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AEVWCED  FESEARDi  PROJECTS  AGENCY 


SEMDWAJCVL  TECHNICAL  REPORT 
June  30,  1971 


1.  DfTROOUCriCN 

The  goal  of  this  project  is  to  provide  an  environment  for  high  quality 
research  activities  in  information  processing.  One  major  area  of  research 
is  mathematical  modeling  and  analysis  of  computer  systems.  Another  principle 

area  of  research,  which  is  closely  coupled  with  the  first,  is  measurement 

i 

of  oanputer  systems.  We  have  been  particularly  active  in  the  analysis  and 
measurement  of  time- shared  systems  and  the  ARPA  computer  communication  net¬ 
work.  Our  third  najor  area  of  responsibility  has  been  in  the  specification 
of  software  protocol  far  use  in  the  network. 

This  report  details  our  progress  frem  the  time  of  ow  last  Semiannual 
Technical  Report  of  August  15,  1970,  through  June  30,  1971.*  References  [1] 
to  [15]  include  work  accomplished  prior  to  the  current  reporting  period. 

In  Section  2  of  this  report,  we  survey  our  progress  in  the  modeling 
and  analysis  of  ccnputer  systems.  In  Section  2.1  we  discuss  the  work  on 
time-shared  systems,  and  in  Section  2.2  the  work  on  oerputer-oennuniaation 
networks.  A  major  effort  has  been  our  work  on  network  measurement,  and 
this  is  discussed  in  Section  3.  Progress  in  the  development  of  our  time¬ 
sharing  system  and  the  network  software  development  :  s  described  in  Section 
4.  Section  5  concludes  with  sane  general  comments  about  our  progress. 


*The  length  of  this  reporting  period  is  due  to  an  adjustment  made  to  oring 
the  Semiannual  Technical  Reports  and  Quarterly  Management  Reports  irto 
synchronization . 
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2.  ANALYTIC  MXELXNG  AND  fCASUFEJtUT  CF  COMPUTER  SYSTEMS 


Our  research  in  ccepuber  system.;  modeling  and  measurement  has  been 
mainly  in  the  areas  of  time-shared  scheduling  algorithms  and  coiputer- 
oarawniaation  networks. 

2.1.  Time-Shared  Systems  Analysis 

During  this  period  we  have  made  considerable  progress  in  our  research 
in  the  analysis  of  time-shared  systoos.  In  the  previous  Semiannual  ‘Techni¬ 
cal  Report  we  reported  on  efforts  by  Kleinrock  and  Mmtz  to  analyze  a  very 
general  class  of  scheduling  algorithms.  This  class  of  scheduling  algorithms 
includes  as  special  cases  most  of  the  algorithms  previously  studied  in  the 
literature  and  also  many  additional  algorithms.  FOr  exacple,  it  includes 
the  ca9e  of  a  multilevel  queueing  system  in  which  round-robin  is  used  at 
sane  intermediate  level  queue  (our  SEX  time-sharing  system  has  such  a  schedu¬ 
ling  algorithm) .  The  method  of  solution  required  analyzing  the  round- 
robin  system  with  bulk  arrivals,  and  this  was  aooenplished  by  Kleinrock, 

Mintz  and  Rodemich  (Ref.  (16]) .  This  paper  farms  Appendix  A  of  this  report. 
The  application  of  these  results  to  multilevel  queueing  systems  is  detailed 
in  Ref.  [17] ,  which  is  included  as  Appendix  B. 

During  this  past  year,  we  have  also  made  progress  in  the  area  of  bonds 
and  approximations  in  the  analysis  of  queuing  systems.  Recently,  Kleinrock, 
Muntz  and  Hsu  have  succeeded  in  finding  tight  upper  and  lower  bounds  on  the 
mean  response  time  far  a  given  amount  of  servioe  time  required  in  an  M/G/l 
queueing  system  with  an  arbitrary  scheduling  algorithm  which  does  not  make 
use  of  a  priori  knowledge  of  a  cus toner 's  servioe  time  requirement.  Given 
the  arrival  rate  of  customers  and  the  service  time  distribution,  bounds  are 
determined  such  that  the  response  function  for  any  scheduling  algorithm 
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oust  at  ail  points  lie  between  the  c  per  and  lower  bounds.  This  is  m 
important  result  since  it  places  nan- trivial  limits  an  what  can  be  aoaarp- 
lished  by  varying  the  scheduling  algorithm  for  a  sa sten.  Also  in  this  work, 
several  recessary  auditions  were  fowd  for  feasible  response  curves  in 
addition  to  the  ipeer  and  lower  bands.  The  results  of  this  work  are  to  be 
reported  at  the  1971  IFIPS  Congress,  Ljubljana,  Yugoslavia,  August  1971  (Ref. 
[18]) .  This  paper  is  inducted  as  Appendix  C  Mb  are  continuing  our  efforts 
in  this  area  with  the  aim  of  further  characterizing  feasible  response  functions. 

In  the  last  Semiannual  Ttechniaal  Report  it  was  mentioned  that  a  major 
unsolved  problem  in  aceputer  systems  analysis  is  the  consideration  of  niltiolc 
resources.  Fouad  Tbbagi,  a  graduate  studmt,  is  working  in  this  area  under 
the  direction  of  Kleinrrcfc.  He  is  investigating  the  application  of  some 
recent  results  from  the  literature  on  approximation  techniques  to  the  problem 
of  analyzing  networks  of  queues.  The  results  to  date  appear  promising  in 
providing  a  computationally  efficient  means  of  analyzing  queueing  networks 
in  which  there  is  a  limiting  resource  or  bottleneck. 

During  this  reporting  period,  we  have  begin  an  effort  in  the  area  of 
measurement  of  time-shared  oosputer  syst— .  Johnny  Mbng,  a  graduate  student 
working  with  Muitz,  has  begin  a  measurement  project  on  the  SEX  time-sharing 
system.  Based  on  measwients  of  proaess  execution  time  xequirsnmnts,  swap¬ 
ping  time  and  page  requiresants,  a  now  scheduling  algorithm  was  dailgred  \.id 
is  currently  being  is*  lamented.  Measurement  of  the  paging  behavior  of  pro¬ 
cesses  has  strongly  suggested  the  possibility  of  increasing  syscem  efficiency 
by  allowing  prooeae—  to  ocsauiioate  page  requirements  to  the  operating  system 
throu^i  system  calls.  A  preliminary  set  of  system  cells  has  been  implemented 
and  is  currently  being  evaluated.  This  spears  to  be  virginal  area  for  stuty. 
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2.2.  QaputeiKSaamnication  Nets 

Me  have  aontinued  a  strong  effort  lit  the  area  of  oaiputer-aanajm  cations 
networks  analysis  and  optimization.  In  particular,  major  areas  of  research 
are:  routing,  nodal  blocking  behavior  in  networks,  and  optimal  assignment 
of  channel  capacity, 

Gary  Fultz,  a  graduate  student  working  with  Kleinrodc,  has  used  mathe¬ 
matical  analysis  and  a  isolation  to  stuty  adaptive  routing  techniquse  in 
store- and- forward  oceputar  networks,  using  awns  mas  sags  delay  ee  a  mea¬ 
sure  of  network  performance,  a  nvmber  of  routing  algo ri them  have  been  evalu¬ 
ated.  The  ability  of  the  algorithms  to  adapt  to  aosnuniaaticn  line  failures 
has  bean  determined  by  simulation.  A  portion  of  this  effort  is  described  in 
Ref.  (19]  (Appendix  D).  Ref.  (20)  is  nearly  ample  ted  and  will  in  addition 
report  an  analysis  of  the  effects  of  nultipadcet  emeeagee  and  nodal  storage 
requirmants. 

An  important  and  difficult  problem  in  store-and- forward  networks  is 
that  of  nodal  bloddng.  Man  a  node's  buffer  storage  is  filled,  it  In  creme 
blocked  and  cannot  recall*  new  aaesagM.  This  puts  an  increased  load  on  this 
node's  neighbors  in  the  network  and  thus,  nodal  blocking  is  a  transient  ef¬ 
fect  Uiich  cm  propagate  in  time  and  spaas.  Kleinrock  and  graduate  studant 
Jade  Zaigler  have  studied  this  problem  and  their  results  are  reported  in 
Ref.  (21)  and  fef.  (22).  Ref  (21)  is  included  as  Appendix  e. 

Graduate  students  Mario  Qerla  and  Luigi  Fratta  have  worked  with  Klein- 
rock  an  amputstlanally  efficient  techniques  for  determining  the  optimal 
assignment  of  channel  capacity  in  a  cceputer-oocBuii cation  network.  The 
objective  is  either  to  minimi ae  deliy  with  the  total  aost  held  fined  or  to 
minimize  total  cost  with  the  de>l<'  held  fijmd.  under  the  aonditiens  of 
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negligible  nodal  blocking  and  fixed  network  topology  and  routing,  an  optimi¬ 
zation  algorithm  has  been  devised  which  is  significantly  more  efficient  than 
conventional  techniques  for  constrained  optimization  problems.  A  report  of 
this  work  is  now  in  preparation  (Ref.  U3] ) .  Further  work  in  this  area  will 
include  the  network  topology  and  routing  as  variables  in  the  optimization. 

At  the  OTP  Congress  71,  Professor  Wesley  Chu  will  present  a  paper 
dealing  with  the  selection  of  an  optimal  message  block  size  for  oonputer 
oamni cations  (Ref.  [24],  Appendix  F).  In  this  research  he  analyzes  the 
relationships  among  ocknowledgnent  time,  dusmel  transmission  rate,  channel 
error  characteristics,  anmrage  message  length  and  optimal  block  size.  Currently, 
Chu  is  aaepleting  a  study  of  demultiplexing  buffer  requirements  using  a  simu¬ 
lation  modal  (Ref.  (25)). 

Professor  Cantor  has  investigated  the  design  of  non-blocking  switching 
networks  with  a  mininun  muter  of  switches.  The  results  of  this  study  are 
included  as  Appendix  G  (Ref.  (26)). 

A  paper  surveying  various  aspects  of  the  optimization  of  oenputer-oomnu- 
ni cation  networks  was  presented  by  Kleinrock  at  the  1971  IIFE  National  Conven¬ 
tion  (tof.  (27),  Appendix  H). 

3.  tBIWORt  ICASHOPfTS 

The  network  measurement  activity  has  involved  a  variety  of  tasks,  includ¬ 
ing  the  further  development  of  the  measurement  tools,  "shakedown"  tests  on  the 
network  perfoxamnoe,  measurement  of  actual  user  traffic,  and  the  use  of  mea¬ 
surements  to  isprcMs  analytic  models  of  the  network  behavior  (Ref.  [28]). 

Each  of  these  areas  is  discussed  in  some  detail  in  the  following  paragraphs. 
Gerald  Cole  has  been  the  principal  participant  in  this  efiurt. 


3.1.  Extensions  to  the  Measurement  Capabilities 

The  acntrol  of  network  experiments  and  the  collection  of  measurement 
data  were  originally  developed  to  operate  in  a  stand-alone  (batch  operating 
system)  environment,  but  have  been  modified  to  also  function  under  the  SEX 
time-sharing  system.  This  change  allows  one  to  conduct  data  gathering  ejqoeri- 
raents  along  with  the  regular  interactive  usage  of  the  system,  and  it  provides 
the  basis  for  further  on-line  data  gathering  and  reduction  usage.  However, 
most  of  the  experiments  rrn  during  the  reporting  period  utilised  the  earlier 
system  due  to  the  large  computation  overhead  of  the  artificial  traffic  genera¬ 
tor.  This  overhead  is  particularly  large  since  the  generator  was  modified  to 
produce  pseudo-randan  message  lengths  and  interarrival  times  in  addition  to 
the  earlier  fixed  parameter  capabilities ,  but  the  random  generation  capabili¬ 
ties  proved  to  be  essential  for  many  of  the  experiments  which  were  conducted. 

3.2.  Analytical  Efforts  Related  to  Measurements 

Some  of  the  more  significant  results  of  the  measurement  efforts  to  date 
have  involved  the  creation  or  improvement  of  analytic  models  of  the  network 
behavior  based  on  insists  gained  from  experimental  measurement  data.  The 
modeling  and  measurement  efforts  were  foind  to  be  quite  aonplementary  and 
resulted  in  an  iterative  procedure  of  model  building  and  evaluation,  with 
feedback  from  each  test  resulting  in  a  more  acceptable  model.  Models  were 
developed  in  this  manner  relating  to  priority  handling  of  messages,  optimal 
packet  sizes,  and  the  separation  of  packet?  due  to  interference  traffic. 
Several  significant  improvements  were  made  in  the  models  based  on  observed 
discrepancies  between  the  observed  and  originally  predicted  behavior  and 
resulted  in  good  agreement  for  the  refined  models. 
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3.3.  Network  Experiments 

In  addition  to  the  measurements  related  to  analytic  models  as  described 
above,  several  experiments  were  run  to  measure  network  usage  and  to  attempt 
to  predict  the  network  performance.  The  first  of  these  tests  involved  the 
measurement  of  the  traffic  between  SHI  and  the  University  of  Utah  in  December 
of  1970.  Data  were  taken  during  several  hours  of  the  SHI  usage  of  the  PDP-10 
at  Utah,  and  these  data  were  correlated  with,  the  known  formats  and  activities 
involved  in  the  transactions.  In  this  manner,  we  were  able  to  gain  informa¬ 
tion  on  the  user  behavior,  and  at  the  sane  time,  verify  the  operation  and 
utility  of  the  measurement  routines. 

One  of  the  primary  concerns  in  the  analysis  of  the  SRI -Utah  traffic 
measurement  was  the  matter  of  how  many  such  users  the  network  could  simulta¬ 
neously  stpport.  A  rather  crude  estimate  was  made  based  on  Scherr's*  model 
of  user  think  time  and  "processing"  needr ,  and  resulted  in  a  range  of  50  to 
170  users  depending  on  the  file  transmission  requirements  of  each  user.  These 
interference  tests  were  extended  fay  use  of  artificial  traffic  and  produced 
saturation  levels  which  were  consistent  with  the  values  as  predicted  by 
Klsinrock's  results,**  and  led  to  further  investigations  of  cyclic  queueing 
phenomena  associated  with  RFtW  driven  traffic  cn  a  given  set  of  links.  The 
through-put  for  such  a  ccnditicn  was  also  investigated  as  a  function  of  the 
mxiber  of  links.  This  latter  test  resulted  in  an  interesting  demonstration 
of  several  of  the  measurement  techniques  in  resolving  a  discrepancy  between 
the  expected  and  measured  saturation  through-put. 

*  Scherr,  A.L. ,  "An  Analysis  of  Time-Shared  Corputer  Systems, "  The  MIT  Press, 
1967. 

**  Kleinrock,  L. ,  "Certain  Analytic  Results  for  Time-Shared  Processors," 

Proc.  IFIP  Congress  1968,  Edinburg,  Scotland,  pp.  D119-D125,  August  5-10,  1968. 
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3.4.  Coordination  with  BEN 


Severed  peculiar  effects  were  encountered  during  the  network  experiments 
which  were  eventually  found  to  be  "bugs"  in  the  network  itself.  Several  of 
these  effects  were  resolved  in  the  DP  system  that  was  released  in  mid-Novem¬ 
ber,  1970,  but  others  were  subsequently  encountered,  particularly  in  regard 
to  the  handling  and  measurement  of  moderately  high  traffic  loads.  This  lat¬ 
ter  problem  became  more  visible  after  we  requested  that  BEN  change  the  round- 
trip  delay  recording  resolution  from  0.1  to  0.8  msec,  to  avoid  a  register 
overflow  problem.  Subsequent  tests  showed  that  the  DP  would  "crash"  at 
certain  traffic  levels,  and  BEN  was  then  able  to  isolate  and  eliminate  the 
problem.  The  network  central  center  personnel  were  quite  helpful  in  these 
efforts  and  also  cooperated  in  the  execution  of  sane  of  the  subsequent  tests, 
e.g. ,  by  changing  selected  DP  parameter  values  during  a  test. 

3.5.  Measurement  Plans 

Measurement  plans  for  the  near  future  include  the  monitoring  of  network 
usage  as  the  new  protocol  b  acmes  operational  and  conducting  a  set  of  "before 
and  after"  tests  to  determine  the  effect  of  the  BEN  changes  in  the  flow  con¬ 
trol  and  routing  algorithm  which  will  soon  be  implemented.  Other  experiments 
will  also  be  run  to  further  evaluate  and  improve  sane  of  the  analytic  models, 
and  to  check  out  new  data  reduction  programs  ae  they  be  cone  available. 

4.  NETWORK  AND  SYSTEMS  90FTVjAFE 

This  section  covers  work  done  by  the  SPACE  Group  which  has  been  under 
the  leadership  of  Steve  Crocker  and  Jon  Postel.  The  effort  has  been  divided 
equally  between  maintaining  and  extending  the  SEX  time-sharing  system  and 
development  of  network  software. 
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4.1.  Network  Progress 

Three  major  meetings  of  the  network  Working  Group  (NWG)  were  held. 

Steve  Crocker  of  UCLA  was  Chairman  and  najor  organizer  of  these  meetings. 

In  conjunction  with  the  Fall  Joint  Oonputer  Conference,  the  NWG  met  in 
Houston  in  Noventer.  Discussion  there  centered  on  the  problems  of  console 
interaction  between  systems,  particularly  the  incompatibility  of  line- 
oriented  local  echo  devices  with  character-oriented  remote  echo  systems. 

It  was  agreed  that  this  incompatibility  would  prevent  acme  users  with  line- 
oriented  consoles  from  using  some  character-oriented  systems,  but  this  could 
be  tolerated.  The  NWG  held  a  February  meeting  at  the  University  of  Illinois. 
Network  protocols  were  the  topic  of  discussion.  The  primly  concern  was 
over  some  needed  refinements  to  the  H06T-H06T  or  lev  1  2  protocol.  A  special 
committee  chaired  by  Steve  Crocker  was  set  ip  to  resolve  these  issues  and  its 
report  (NWG/RPC  #107)  is  ar  official  modification  to  the  protocol.  This 
report  calls  for  new  oornand  formats,  new  aamands  (Reset,  Reset  reply) , 
replacement  of  narking  with  a  fixed  size  header,  and  the  introduction  of  byte 
sizes.  Also  discussed  at  the  Illinois  meeting  were  the  use  of  sockets  and 
the  initial  connection  protocol.  A  third  NWG  meeting  was  held  at  Atlantic 
City  in  conjunction  with  the  Spring  Joint  Ccrputer  Conference  in  May.  Prime 
topics  of  discussion  at  this  meeting  were  several  3rd  level  protocols,  e.g., 
Telnet-Logger,  File  Transfer,  and  initial  connection  procedures.  Oornnittees 
were  established  to  deal  with  each  of  these  topics.  The  Telnet-Logger  issues 
were  resolved  at  the  May  meeting  and  initial  connection  protocol  was  estab¬ 
lished  in  early  June. 

Implementation  of  a  Network  Control  Program  (NCP)  and  Telnet  and  Logger 
programs  which  follow  the  official  specifications  are  now  completed  and  oper- 
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ational  on  our  tine-sharing  system. 

The  SPADE  Group  has  provided  support  for  the  measurement  experiments 
conducted  by  Gerald  Cole.  The  measurement  programs  can  now  be  run  uncer 
the  SEX  system  in  parallel  with  other  network  and  local  use  of  the  system. 

4.2.  System  Development 

The  .'jEX  time-sharing  system  has  grown  to  support  more  of  the  users 
of  the  Sigrra-7.  We  have  acquired  to  IMLAC  PD6-1  display  terminals  and  four 
model  33  teletypes.  The  operating  system  has  been  changed  in  several  ways. 
Sane  of  the  changes  are  corrections  to  defects  in  the  system  as  acquired 
from  LRL.  The  file  system  and  resident  operating  system  were  made  more 
independent  so  that  a  system  crash  no  longer  causes  the  file  system  to  be 
destroyed.  A  garbage  collection  process  now  reclaims  lost  file  space  and 
forces  file  system  consistency.  An  interprocess  communication  facility 
called  Events  was  added.  A  batch  processing  facility  has  been  implemented 
to  provide  service  for  non-interactive  users  and  a  tape  input-output  super¬ 
visor  his  been  implenen tod .  An  operator's  control  program  has  been  imple¬ 
mented  which  allows  the  selective  starting  and  stopping  of  system  level 
programs,  e.g. ,  NCP,  printer  process.  The  system  is  currently  scheduled  for 
standard  user  service  20  hours  per  week.  Reliability  has  been  improved  to 
the  point  that  system  crashes  now  occur  on  the  average  of  once  per  week. 

5.  CONCLUSIONS  AND  SELF-EVALUATION 

Our  efforts  in  the  mothonatical  modeling  and  measurement  of  computer 
systems  has  been  very  profitable  during  this  period.  In  the  oorputer  net¬ 
works  area  particularly,  our  two-pronged  attack  with  analysis  and  measurement 
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has  yielded  significant  results.  Me  have  begun  a  measurement  effort  in 
connection  with  time- shared  systems  and  plan  to  accelerate  this  effort  in 
cooperation  with  our  analytic  work  in  this  area.  Progress  on  the  development 
of  network  protocol  las  been  substantial .  Developmental  work  on  the  SEX 
time-sharing  system  has  not  diminished  as  much  as  had  been  expected,  but  it 
has  been  a  necessary  and  worthwhile  investment.  We  plan  to  continue  shift¬ 
ing  more  oc  our  efforts  toward  the  modeling  and  measurement  areas. 

Our  efforts  have  established  UGLA  as  a  leader  in  the  field  of  modeling 
and  analysis  of  computer  systems  and  am  important  member  of  the  ARPA  Network 
community. 


11 


FETCFBXXS 


1.  Xleinrock,  L.  "Time-Sharing  Systans:  Aralytical  Methods,"  Proc.  of 

the  Symposium  on  Critical  Factors  in  Data  ]>ttMgaqervt/1968,  UCLA, 
rtur dTSfl-y,  Prend®HHall,  pp.  J-3J,  lSST  - 

2.  ffartin,  D. ,  and  G.  Estrin.  "Path  Length  Computations  on  Graph  Models 
of  Computations,"  Transactions  of  the  IEEE,  Vol.  C-18,  pp.  530-536, 

June  1969. 

3.  Kleinrock,  L.  "Models  for  Computer  Networks,"  Proc.  of  the  Tray  inter- 
national  Conference  an  Catmunications,  Boulder,  Oolo.,  pp.  21-16  to 

zrayraca  tothse - 

4.  Kleinrock,  L.  "On  Swap  Tine  in  Time-Shared  Systans,"  Proc.  of  the  TREE 
Computer  Group  Oonferenoe,  Minneapolis,  Minn.,  pp.  37-41,  June  17-19, 

5.  Coffman,  E.G.,  Jr.,  and  R.  R.  Muntz.  "Model  of  Pure  Time-Sharing  Disci¬ 
plines  far  Resource  Allocation,"  Proc.  of  the  24th  National  Conference 
of  API,  August  1969. 

6.  Chu,  W.  W.  "A  Study  of  Asynchronous  Tine  Division  Multiplexing  for 
Time-Sharing  Computer  Systans,"  1970  Proc.  of  the  Fall  Joint  Ocnputer 
Oonferenoe,  Las  Vegas,  Nev. ,  pp.  66^78,  November  5.96$. 

7.  Kleinrock,  L.  "Comparison  of  Solution  Methods  for  Ocnputer  Network 
Models,”  Proc.  of  the  Computer  and  Communications  Conference,  Rone,  N.Y., 
Oct.  2,  1$69. 

8.  Mints,  R.  R. ,  and  R.  Uzgalis.  "Dynamic  Storage  Allocation  for  Binary 

Search  Trees  in  a  TV*>-Level  Memory,"  Proc.  of  the  Fourth  Annual  Princeton 
Oonferenoe  on  Information  Sciences  and  Systans,  Princeton,  N.J.,  Marcfi 
2<5-27,  157(5. - - - 

9.  Kleinrock,  L.  "A  Continuum  of  Time-Sharing  Scheduling  Algorithms,” 

Proc.  of  the  1970  Spring  Joint  Computer  Conference,  Atlantic  City,  N.J., 

re:"TO'-3S8-,~itey  1376.  -  - 

10.  Kleinrock,  L.  "Analytic  and  Simulation  Methods  in  Computer  Network 
Design,"  Proc.  of  the  1970  Spring  Joint  Computer  Conference,  Atlantic 
City,  N.J.,  pp.  569-579,  Msiy  1^75. 

11.  Carr,  C.  S.,  S.  D.  Crocker,  and  V.  G.  Cerf.  "HOOT-HOST  Communication 
Protocol  in  the  APPA  Network,”  Proc.  of  the  1970  Spring  Joint  Computer 
Conference,  Atlantic  City,  N.J.,  pp.  589-597,  May  1970. 

12.  Chu,  W.  W.  "Selection  of  Optimal  Transmission  Rate  for  Statistical 
Multiplexers,"  Proc.  of  the  1970  IEEE  International  Oonferenoe  on  Com¬ 
munications,  San  Francisco,  Calif . ,  pp.  28-22  to  28-25,  June  8-10,  1970. 


12 


13.  Kle inrock,  L.  "Swap  Time  Considerations  in  Time-Shared  Systems, "  tetf 
Transactions  on  Corputaers,  pp.  534-540,  June  1970. 

14.  Kleinrock,  L. ,  and  R.  R.  Muntz.  "Multilevel  Processor-Sharing  Queueing 
Models  for  Time-Shared  Systems,"  Proc.  of  the  Sixth  International  Tele¬ 
traffic  Congress,  Minich,  Germany,  pp.  341/1-541/8,  August  1970. 

15.  Chu,  W.  W.  "Buffer  Behavior  for  Batch  Poisson  Arrivals  and  Single  Con¬ 
stant  Output,"  ™  Trans,  on  Oonmini cation  Technology,  October  1970. 

16.  Kleinrock,  L. ,  R.  R.  Muntz;  and  E.  Rodemich.  "The  Processor-Sharing 
Queueing  Model  for  Time-Shared  Systems  with  Bulk  Arrivals,”  to  be  pub¬ 
lished  in  Networks,  Vol.  I,  No.  1. 

17.  Kleinrock,  L. ,  and  R.  R.  Muntz.  "Processor-Sharing  Queueing  Models  of 
Mixed  Scheduling  Disciplines  for  Time-Shared  Systems,"  to  be  published 
in  J.  Assoc.  Oocputing  Machinery. 

18.  Kleinrock,  L. ,  R.  R.  Mjntz,  and  J.  Hsu.  "Tight  Bounds  on  the  Average 
Response  Time  far  Time-Shared  Oonputer  Systems,”  to  be  presented  at  the 
1971  International  Federation  for  Information  Processing,  Ljubljana, 
Yugoslavia,  August  23-28,  1971. 

19.  Fultz,  G. ,  and  L.  Kleinrock.  "Adaptive  Routing  Techniques  for  Store- 
and-Farward  Oooputer-Ccrmmi cation  Networks,"  1971  International  Confer 
ence  on  Oormunicationa ,  Montreal,  Canada,  June  14-16,  1971. 

20.  Fultz,  G.  "Adaptive  Routing  Techniques  for  Store-and-Forward  Message 
Switching  Oorputer-Ccmnunication  Networks,"  Ph.D.  dissertation,  School 
of  Engineering  and  Applied  Science,  Ccrputer  Science  Department,  Univer¬ 
sity  of  California,  Los  Angeles,  1971. 

21.  Zeigler,  J.,  and  L.  Kleinrock.  "Nodal  Blocking  in  Large  Networks,"  1971 
International  Conference  cn  Ccmunications,  Montreal,  Canada,  JUne  14-16, 
1971. 

22.  Zeigler,  J.  "Nodal  Blocking  in  Large  Networks,”  Ph.D.  dissertation, 

School  of  Engineering  and  Applied  Science,  Ccrputer  Science  Department, 
University  of  California,  Los  Angeles,  1971. 

23.  FTatta,  L. ,  M.  Gerla,  and  L.  Kleinrock.  "The  Slow  Deviation  Method:  A 
New  Approach  to  the  Analysis  and  Synthesis  of  Store-and-FOrwerd  Communi¬ 
cation  Networks, "  to  be  published  in  Networks. 

24.  Chu,  W.  w.  "Optimal  Fixed  Message  Block  Size  far  OQeputer  Oonrunications,” 
to  be  presented  at  the  1971  International  Federation  for  Information 
Processing,  Ljubljana,  Yugoslavia,  August  23-28,  1971. 

25.  Chu,  W.  W.  "Demultiplexing  Considerations  for  Statistical  Miltiplexors, " 
to  be  presented  at  the  ACM  Second  Synpoeiun  on  the  Problems  in  the  Opti¬ 
mization  of  Data  Oorrunicaticn  Systons,"  Palo  Alto,  Calif.,  Oct.  18-20, 
1971. 


13 


26.  Cantor,  D.  G.  "o  i -  Blocking  Switching  Networks,"  to  be  published 
in  Networks,  Vol.  i,  no.  1. 

27.  Kleinrock,  L.  "Delay  in  Ocnnunication  and  Oonputer  Networks,"  Trry  71 
International  Convention  and  Deposition,  New  York,  Muxh  22-25,  1971. 

28.  Cole,  G.  D.  "Ccnputer  Network  Measurements:  Techniques  and  Experiments," 
Ph.D.  dissertation,  School  of  Bigineering  and  Applied  Science,  Oonputer 
Science  Department,  university  of  California,  Los  Angeles,  1971 

29.  Hsu,  J.  "Analysis  of  a  Continuum  of  Prooessar-Sharing  Models  for 
Time-Shared  Gooputer  Systsns,”  Ph.D.  dissertation.  School  of  Engineering 
and  Applied  Science,  oonputer  Science  Department,  university  of  Cali¬ 
fornia,  Los  Angeles,  1971. 


Presentations 

1.  Kleinrock,  L.  "Survey  of  Analytical  Results  of  Time-Shared  Oonputer 
Systems,"  North  Carolina  Stats  university,  Raleigh,  Oct.  27,  1970. 

2.  Kleinrock,  L.  "Time-Shared  System®  Analysis,"  university  of  Illinois, 
Urbana,  Oct.  28,  1970. 

3.  Kleinrock,  L.  "Time-Shared  Systans  Analysis,”  Operations  Research 
Society  of  American  Conference,  Detroit,  Mich. ,  Oct.  20,  1970. 

4.  Kleinrock,  L.  "Recent  Results  in  Time-Shared  Systems  Analysis,"  Applied 
Physics  and  Information  Science  Department,  university  of  California, 

San  Diego,  Nov.  16,  1970. 

5.  Kleinrock,  L.  "Survey  of  Analytical  Methods  in  Queueing  Networks,"  New 
York  University,  Nov.  30,  1970. 

6.  Kleinrock,  L.  "Graphs  and  Networks  in  the  Real  World,"  International 
Conference  on  Circuit  Theory,  Atlanta,  Ga. ,  Dec.  16,  1970. 

7.  Kleinrock,  L.  Attended  ARPA  Contractors  Meeting,  San  Diego,  Calif., 

Feb.  9-11,  1971. 

8.  Kleinrock,  L.  "Time-Sharing  System  Modeling,”  ACM  SKJTDC  Group,  Systems 
Development  Corporation,  Santa  Monica,  Calif.,  March  8,  1971. 

9.  Muntz,  R.  "Bounds  on  the  Response  Characteristics  of  Queueing  Systems,” 
IBM  Yarktown  Heights,  N.Y.,  April  2,  1971. 

10.  Kleinrock,  L. ,  and  R.  R.  Mintz.  "Processor-Sharing  Queueing  Models  of 
Mixed  Scheduling  Disciplines  far  Time-Shared  Systems,"  39th  Meeting  of 
Operations  Society  of  America,  Dallas,  Tex.,  Mey  5-7,  1971. 


14 


APPENDICES 


15 


-APPEM3IX  A 


BC  PROCESSOR-SHRRIMG  QUEUEING  KEEL  FOR  TDC-6HMED 
SY9IB6  WITH  BUX  ARRIVALS 
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THE  PRXESSOR-GHAKING  QUXEING  MODEL  FOR  TM-SHAIED  SYSTEMS 

WITH  BULJC  ARRIVALS* 


by  L.  Kleinrock,  R.  R.  Muntz 

Ocrputer  Science  Department 
university  of  California,  Los  Angeles  California 

and 

E.  Rodemich 

Jet  Propulsion  Laboratory 
Pasadena,  California 


ABSTRACT 


Vie  consider  a  model  which  is  applicable  to  time-multiplexed  systems, 
such  as  multiplexed  aonnunication  channels  and  time-shared  ocnputing 
facilities.  In  this  (processor-sharing)  queueing  model,  all  jobs  currently 
in  the  system  share  equally  the  processing  capability  of  the  server.  In 
this  {Spar,  we  investigate  the  processor  sharing  model  for  the  case  of  bulk 
arrivals.  The  moan  response  time  of  the  system  as  a  function  of  required 
service  time  is  derived.  An  example  is  given  to  show  the  effect  of  bulk 
arrivals  versus  single  arrivals  for  a  constant  utilisation. 


*  This  work  was  supported  by  the  Advmtoad  research  Projects  Agency  of 
the  taper (mail  of  Defense  (DAHC-15-69-C-0285) .  This  paper  also  presents 
the  results  of  one  phase  of  research  carried  out  at  the  Jet  Propulsion 
Laboratory,  California  Institute  of  Technology,  under  Contract  No.  NAS  7-100, 
sponsored  by  the  National  Aeronaut  ire  and  Space  AcfcdnistraU.cn. 
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THE  PROCESSOR-SHARING  QUEUEING  MXEL  FOR  TIFE-SHAFED  SYSTEMS 

WITH  BULK  ARRIVALS* 

by  L.  Kleinrock,  R.  R.  Muntz 
Ounputcr  Sdenoe  Department 
university  of  California,  Los  Angeles,  California 

and 

E.  Rodemich 

Jet  Propulsion  Laboratory 
Pasadana,  California 

I.  INTHOCUCTICN 

In  a  tine-sharing  system,  the  acnputing  facilities  are  tine-multiplexed 
among  the  currently  active  jobs  according  to  sene  echeduling  discipline,  A 
major  goal  of  the  scheduling  discipline  is  to  provide  short  response  tines  to 
mail  requests  for  service.  This  creates  an  effective  environment  for  inter¬ 
action  between  a  user  at  a  oonsole  and  the  acnputing  facility  sinoe  most 
interactive  requests  are  for  relatively  snail  anoints  of  service.  The  user 
should  expect  longer  delays  if  his  request  is  for  a  significant  amount  of 
service.  Analytic  studies  of  these  systms  are  aimed  at  determining  the  effect 
of  various  scheduling  disciplines  on  response  time. 

In  oanputar  networks,  we  are  often  faced  with  a  configuration  in  which 
many  time-shared  ounputer  systems  are  interconnected  over  a  oamunioation  net¬ 
work.  It  is  inportant  to  wderstand  the  behavior  of  these  time-shared  nodes 
so  that  one  can  evaluate  the  performance  of  these  networks. 

The  application  of  queueing  models  to  time-shared  aenputar  systems 
has  been  an  active  area  of  research  sinoe  1964  [1] .  A  survey  of  this  ares 


*  This  work  was  supported  by  the  Advanced  Research  Projects  Agmcy  of 
the  D^xurtnmt  of  Defense  (DAHC-15-69-C-0285) .  This  paper  also  preamts  the 
results  of  one  phase  of  research  carried  out  at  the  Jet  Propulsion  Laboratory, 
California  Institute  of  Technology,  vndsr  contract  No.  NAS  7-100,  sponsored 


is  available  in  reference  [2] .  In  this  paper,  we  generalize  sore  previous 
models  by  permitting  bulk  arrivals  to  the  system. 

In  the  usual  romd-robin  scheduling  discipline,  a  newly  arriving 
job  oust  join  the  end  of  a  queue  for  the  server.  When  it  reaches  the  front 
of  the  queue,  it  is  allocated  a  quantum  of  time  on  the  server.  If  the  job 
completes  before  the  quantum  expires,  it  leaves  the  system.  Otherwise,  it 
must  rejoin  the  end  of  the  queue  to  wait  for  its  next  quantum.  In  this 
paper,  the  quantum  size  is  allowed  to  shrink  to  zero  so  that  we  have  the 

Li. 

"processor-sharing"  discipline.  In  effect,  each  job  receives  1/n  of 
the  processing  capability  of  the  processor  when  there  are  n  jobs  demanding 
service.  The  model  is  illustrated  in  Fig.  1.  This  proaesaor-sharing  dis¬ 
cipline  was  first  introduced  in  1967  [3]  and  analyzed  for  the  case  of 
single  Poisson  arrivals.  In  this  paper,  we  consider  the  bulk  arrival 
case  where  customers  may  arrive  in  groups.  The  arrival  instants  are, 
as  usual,  assumed  to  be  Poisson. 
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Aside  from  the  case  where  bulk  arrivals  may  be  the  actual  arrival 
mechanist!  far  the  system,  the  bulk  arrival  case  presented  here  arises 
naturally  in  solving  mare  general  models.  Consider  the  case  where  the 
server  becomes  unavailable  for  periods  of  time  with  distribution  F(d)  and 
mean  then  the  server  once  again  becomes  available,  he  finds  that  a 
number  of  new  customers  have  joined  the  system  and  so  it  appears  to  him  that 
a  bulk  arrival  has  taken  place.  Assure  also  that  the  interval  between  the 
end  of  one  of  these  "down”  times  and  the  start  of  the  next  is  exponentially 
distributed  with  mean  u. 

The  queueing  time  for  the  job  can  be  thought  of  as  the  sun  of  two  in¬ 
tervals!  the  time  between  the  arrival  instant  and  the  first  time  the  server 
becomes  available,  and  from  this  time  until  the  job  leaves  the  system.  The 

mean  length  of  the  first  interval  is  easily  calculated  to  be  »  ■  ■■  • 

2(3  ♦  u) 

Tb  find  the  mean  length  of  the  second  interval  we  use  the  following  approach. 
Vie  telescope  the  time  axis  to  reduce  all  of  the  server  "down"  intervals  to 
zero  length  but  keep  the  nutter  of  arrivals  the  same  in  each  interval.  On 
the  new  time  axis,  the  set  of  arrivals  leaving  a  down  time  aonears  as  a  bulk 
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arrival.  The  generating  function  for  the  bulk  size  is  easily  calculable. 

Now  solve  for  the  mean  waiting  time,  ^  for  a  job  in  this  "virtual"  time 

frame.  The  virtual  waiting  time  and  the  actual  waiting  time,  W, ,  ire  related 
w  ** 

by  W.  - ^ —  .  The  mean  nueueinq  time  for  a  job  is  just  the  sin  of  the 

A  l  -  u3 

means  of  the  two  intervals  described. 

Other  situations  in  which  servicing  of  a  class  of  jobs  is  interrupted 
can  be  apnroadied  in  this  manner.  For  exarrole,  orioritv  queueing  models 
with  arbitrary  service  disciplines.  The  approach  has  already  been  applied 
to  the  analysis  of  feedback  queueing  models  for  time-shared  systems  [4] . 


2.  MODEL 

Formally,  the  parameters  of  the  model  are: 

1)  the  lnfeerarrival  times  are  exponentially  distributed  with  the  moan 
interarrival  time  X,  i.o., 

P[ interarrival  tine  <  t]  ■  1  -  e“Xt  t  >  o 


2)  the  distribution  of  required  service  time  in  the  CPU,  B(t) 

-Bt 

|  1  -  CMtjC 

is  assured  to  be  of  the  form  B(t)  ■ 


1  -  o(t)e 
0 


o  <  t  <  t. 


where  q(t)  is  a  polynomial  in  t  of  degree  n.  This  of  course,  in¬ 
cludes  the  exponential  service  time  distribution  vhen  q(t)  -  1. 

3)  the  distribution  of  a,  the  bulk  size  is  arbitrary  with  mean  a. 


The  generating  function  of  a  is  G(z)  ■  £  z*Pla  »  i). 

i-o 


4)  the  queueing  discipline  is  processor- sharinq,  i.e. ,  the  limit  of 
the  round-tobin  discipline  as  the  quantun  size  approaches  zero. 
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Mb  want  to  solve  for  the  mean  queueing  tine  to  serviae  *  job  requiring 
t  seconds  of  serviae.  This  is  denoted  by  T(t). 

3.  AN  DFIBGRAL  EQUATION  INSCRIBING  AVERAGE  RESPONSE  TIME 

Mb  approach  this  problem  by  first  considering  a  discrete  tine  system 
with  quantum  size  q  >  0.  Mb  assume  that  arrivals  and  departures  take  place 
only  at  times  that  are  integral  multiples  of  q.  For  small  q  any  continuous 
distribution  can  be  approximated.  By  letting  q  approach  0  aquations  for 
continuous  time  systems  can  be  found. 

Let  n(iq)  ■  the  mean  number  of  jobe  in  the  system  with  iq  seconds  of 
attained  service  when  a  tagged  job  arrives. 

c  %  ■  the  probability  that  a  job  which  haa  received  iq  teoonda 
of  aarvioa  will  require  more  than  (i  +  l)q  aeoonda  of 
aarvica. 

a  -  the  mean  bulk  sirs  of  arrivals, 
b  -  the  mean  number  of  arrivals  with  a  tagged  job. 

Sines  we  intend  to  let  q  approach  0,  the  position  of  the  tagged  job 
with  respect  to  the  jobs  that  arrive  in  the  same  group  is  not  important.  Mb 
will  assume  far  convenience  that  the  tagged  job  is  the  last  job  in  the  group. 

The  mean  time  until  the  tagged  job  has  received  its  first  quart  am  of 
service  is  given  by 

T1  *  2  n  ♦  *1  ♦  q 
j-0 


22 


In  general,  the  earn  tine  between  the  (i  -  1)  end  i  quantun  of  service 
to  the  tagged  job  is  given  by 

m 

Ti  -Z  <“<H)oj0j+1  ...  o1+i.jq) 

j«o 

i-1 

♦  ^  (XiTj°0°l  •••  °i-j-2q) 

♦  q  ♦  bloQoi  ...  o^Jq  (1) 

Ihe  first  tauu  represents  the  tine  required  by  those  jobs  which  were 
initially  in  the  system  and  will  still  be  there  after  the  tagged  job  has 
reoeived  i-1  quanta  of  eervioa.  Ihe  second  tens  is  the  contribution  due 
to  jobs  that  have  arrived  since  the  tagged  job  entered  the  system.  Ihe 
third  term  is  due  to  the  tagged  job  itself.  The  last  term  results  from 
those  jobs  which  arrived  with  the  tagged  job  and  require  more  than  i-1 
qusnta  of  sarviae. 

Dividing  both  sides  of  Eq.  (1)  by  q  we  get 

«p  **  ^ 

"i"  ^  n<**)oj°j*.l  •••  °j+i-2 
i-1 

^  XiTj°0°l  —  °i-j-2 

♦  1  bo0°i  ...  ow  (2) 
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Let  lq  -  t  and  jq  ■  x.  than  as  q  Ot 


—  -  T'  (T)  = 


**•  0j4i-2  "I  -  B(x) 


1  -  B(X  4  T) 


n(jq)  ♦  n(x) 


...  Oj.j.2  ♦  1  -  B(x  -  x) 
aQa1  ...  oi-x  ♦  1  -  B(t) 


Therefore  as  q  -*■  0  Eq.  (2) 


T'  (x)  -  /  n(x) 
0 


1  -  B(x  4  T) 


TCT 


dx 


♦  Xa  f  T'  (x)  (1  -  B(r  -  x)  ]dx 


1  4  b[l  -  B(t)  ) 


Fran  Kleinrock  and  Ooffinsn  (6)  we  also  have  that 


n(x)  «  Xa(l  -  B(x)]T'(x) 


Substituting  for  n(x)  ws  have 


T'  (x)  »  Xa  f  T'  (x)  [1  -  B(x  ♦  x)  ]dx 


4  Xa  f  TT'  (x)  [1  -  B(x  -  x)  ldx 

^  A 


4  1  4  b[l  -  B(x)  ] 


(3) 
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In  terms  of  the  generating  function  G(z)  for  a,  we  have  that  a  *  g*  (1) . 
The  value  of  b  can  also  be  expressed  in  terms  of  r.(z) .  Consider  that  the 
tagged  job  is  selected  at  random  fran  the  arrivals  to  the  ourueinc  svstar. 
Then  the  probability  that  the  job  is  selected  from  a  bulk  size  of  n  jobs  is 

given  bv  (5) . 

a 


Therefore  b  ♦  1 


nP[a  -  nl 
a 


a 


G-(l)  ♦  G’(l) 
- 5*717“- 


or 


b 


G"  (1) 

gtTTT  • 


It  remains  only  to  solve  the  integral  equation#  Eq.  (3) . 


4.  SCLLfnOM  OP  THE  INTEGRAL  EQUATION 

In  this  section  we  solve  the  integral  equation  (3)  for  the  averaqe 
response  time  T(x) .  Recall  that  we  have  restricted  the  servioe  time  dis¬ 
tribution  B(t)  such  that 


With  1  -  B(t) 


o 


O  <  t  <  tj_ 


t  >  t 


1 


where  q(t)  is  a  polynomial  of  degree  n. 
Then  Bq.  (3)  becomes 


TMt) 


Xa  /*  ^  T  T'0c)q(x  «■  x)e"u(x+T,dx 
■'o 


♦  Xa  f  T*  (x)q(t  -  x)e“u(T"x)dx 

♦  bqtxJe"111  ♦  1  (4) 
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After  multiplying  to.  (4)  by  ewT,  it  may  be  rewritten  as 


t,-t 


eUTr  (t)  «  Xa 


Z1  e"uxq(T  + 


x)T'  (x)cbc 


L 


♦  Xa  /  e^atT  -  x)T‘ (x)dx 
'o 


4  bq(t)  -t  e 


UT 


(5) 


Let  D  denote  the  differential  operator  d/dx.  Differentiating  to.  (5) 
n  4  1  times,  we  get 

DrHl[eUTT'  (t)]  -  Xi  £  o(k,(0)rf1‘k[euxT'(T)J 

k-o 


-  Xi  f  -  t)]  4  u^V11 


k-o 

where  q  ^  is  the  k^  derivative  of  q.  Now,  multiplying  by  c-^1 ,  the  result 
can  be  put  in  the  form 


0_(D)T' (t)  4  0, (D)T' (t.  -  T)  -  u 
o  11 


n4l 


(6) 


where  0Q(D)  and  0^(0)  are  linear  differential  operators  with  constant  co¬ 
efficients,  given  by  the  following  formulas : 


0Q(O  -  (S  4  u)1*1  -  Xa  E  q™  (OMC  ♦  u> 

k-o 


n 


_(k) 


n-k 


01(C)  »  Xa  £  e"lltlq(k)(t1)(C  4  u)n_k 
k«o 
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Replacing  t  by  t^  -  t  in  Fg.  (6) ,  get 


01(-D)T,(t)  ♦  0o(~O)T  (tx  -  t)  -  u 

Apply  -O^tD)  to  this  ecruaticr,  0Q(-0)  to  (2),  end  add.  Then  we  have 
02(D)T'(t)  -  (0o(0)  -  O^OHi/*1 


where 


02(0  •  0o(C)0ol-C)  -  (-C) 


Since  02(U  is  unchanged  when  C  is  replaced  by  -£,  its  roots  occur  in 
pairs  (^,  -  c^),  m  ■  1,  n  +  1,  and 

02(C)  -  (-d^1  n  (c  -  o^) 

For  general  q( t)  ,  these  roots  are  distinct  and  nan-aero.  Then  the  general 
solution  cf  Eq.  (7)  is 

rvfl  -QT  a  T 

T'(t)  -a04  £  ♦  V  >  (0) 


where  the  oenstent  aQ  is  gtan  by 


aQ  «  (0o(0)  -  01(0))u  A^CO) 


which  can  be  reduced  to 


1  -  Xat 


where  t  is  the  scan  of  B(t) . 

A  formula  for  T'  (t^  -  t)  follows  fron  Eq.  (8)  by  replacing  T  by  -  T 
Using  these  eoqsressions  in  Eg.  (6) ,  and  eguatina  the  coefficient  of  each 
exponential  to  aero,  we  get  the  conditions 
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wv  ♦  v"%\v 


,  W  *  1 ,  •  •  •  , 


n  ♦  1 


which  wc  can  satisfy  by  putting 

%,  -  WV> 


•v, 


V“\ 


‘V 


w  *  1 »  •  •  •  $  n  ^  X 


Then 


r *  £  C»,VV,*"V  *  W^'  191 

This  expression  for  T'  (x)  must  ho  put  in  the  original  integral  Etj.  (4) 
to  determine  the  coefficients  Cj,  Collecting  the  coefficients 

of  the  various  exponentials  which  arise,  we  get  the  following  system  of 
equations: 


(u  -  (u  ♦ 


|  b,  1  -  1 

(  0,  1  •  2,  ...,  n  ♦  1 


(10) 


In  e<prt*ssinq  the  solution  o*  this  system  by  determinants,  only  the 
determinant  of  the  coefficients,  and  the  oof  actors  of  the  coefficients  with 
j  ■  1  are  needed.  These  can  he  given  exnHcitlv: 
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A  »  det 


1 


<u  -  <y  ’  (u  ♦  e^) 


T 


Wi  # • • • ,r*l 


i-(r*2)  0*1)  «n(n+l)  *n(r*l)  pi  5  o 

!  <-»  V- Viu  Ji'vv 

J  K 


r*l 


2  2.n+l 


and  a  typical  cof actor  la 

1 


A^  ■  det 


(V  -  am)l  (y  ♦  <^)5 


m,j«2,...,r*l 


i(i*2)  (m-l)  in 0*1) 

1  <-l)2  03. 


jn0*l) 


n  2  2 

•W  II  ‘  V 

1>k>2  3  * 


r*l 


2  _2,i*l 


2  &  <"  -  <> 


He  Im 


^cl*0o<ol> 


(Oj))  •  bAjj/A 


b(y2  -  a2)**1 

E  -  4» 


-b(u2  -  a2)"*1/  Q*  (a^ 


and  the  other  coefficients  have  similar  fooulas.  Using  these  in  Ep.  (9) , 
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t'  :t>  - 


1  -  Xat 


_b 

Xa 


<u2  -  a2)1*1 

W 


-a/r  -a  (t.-T) 

We  -  °i  V* 


(id 


w 


♦  e 


°i(V 


This  last  equation  is  the  solution  to  Eq.  (3)  which  we  aze  seeklnp.  It 
is  interesting  to  observe  that  for  the  nan-bulk  arrival  case  (i.e. , 

P(a  »  1)  ■  1) ,  Eq.  (11)  reduoe&  to 

T‘  w  -  r h 

where 

p  -  Xa/w 


This  is  the  veil-known  result  for  single  Poisson  arrival  to  a  round  robin 
prooessor  sharing  system  with  arbitrary  serviae  time  distribution  [7] . 


5.  AN  BUMPI£ 

Let  q(t)  -  1  and  t^  ■  so  that  the  service  times  are  exponentially 
distributed.  Then 

0o(O  -  C  ♦  V  -  Ai 
(^(O  -  0 

Q2(0  -  v2  -  2yXa  ♦  X2a2  -  ?2 
The  roots  of  02(Z)  are  t  (y-  Xa)  - 1  ax 
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Therefore 


T*  (t) 


1  -  Aa/u  Aa 


-o,t 


(u2  -  fl^)  (Oj  ♦  \i  -  Aa)e  1 


-2a, 


a^  +  u  -  Aa 


T(t) 


1  -  Xa/u  'a 


(U2  -  a2) <0l  +  y  -  x5)[e"°lT  -  l] 
(-20^)  (-a^)  (a^  +  y  -  Aa) 


or 

(U2  -  (u  -  Aa)2)  (1  -  e'(|l^)T) 

2(u  -  Xa)2 

Figure  2  shows  this  average  response  function  for  the  case  X  «  0.75, 
u  “  1*0,  a  ■  0.385,  b  ■  0.746.  The  parameters  in  this  exanple  were  chosen 
to  oorrejpond  with  an  exaiple  from  Reference  [4] .  Also  shown  is  the 
solution  for  the  non-bulk  arrival  case  with  the  sane  service  time  distribu 
ticn  and  the  same  mean  arrival  rate. 


T(t)  - - - - +  — 

1  -  Xa/u  Xa 


31 


2 


t 


3 


n-  1M.  #•* m,  •  jm.  b-.74C 
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6.  OONCIJUSICN 

The  processar-aharinq  queueinq  model  with  bulX  arrivals  has  been 
studied.  An  integral  equation,  Eq.  (3),  describing  the  mean  queueing  tune 
for  a  job  requiring  t  seconds  of  service  was  developed  which  is  valid  for 
arbitrary  service  tine  distributions.  This  inteqral  equation  was  solved  in 
Section  4  for  a  class  of  service  time  distributions  which  includes  the 
exponential  distribution.  The  application  of  the  bulX  arrival  model  to 
queueing  systems  with  periods  of  server  unavailability  were  indicated  in 
Section  1. 


33 


FEFEFQCES 


1.  Kleinrock,  L. ,  "Analysis  of  a  Time-Shared  Processor,"  Naval  Research 
Logistics  Quarterly,  Vol.  II,  No.  1,  pp.  59-73,  March  1964. 

2.  McKinney,  J.  M. ,  "A  Survey  of  Analytical  Time-Sharing  Models," 
Ocnputing  Surveys,  Vol.  1,  No.  2,  June  1969,  pp.  105-116. 

3.  Kleinzock,  L. ,  "Time-Shared  Systons:  A  Theoretical  Treatment,"  JACM, 
Vol.  14,  No.  2,  April  1967,  pp.  242-261. 

4.  Kleinrock,  L. ,  and  R.  R.  Muntz,  "Multilevel  Processor-Sharing  Queueing 
Models  for  Time-Shared  Systems,"  Proc.  of  the  Sixth  International 
Teletraffic  Congress,  Munich,  Germany,  pp.  541/1-341/8,  August  1570. 

5.  A vi- Itzhak,  B. ,  w.  L.  Maxwell,  and  L.  W.  Miller,  "Queueing  with 
Alternating  Priorities,"  Operations  Research  13,  No.  2,  1965. 

6.  Kleinrock,  L. ,  and  E.  G.  Coffman,  "Distribution  of  Attained  Service 
in  Time-Shared  Systems,"  J.  of  Ocnguters  and  Systems  Science,  Vol.  3, 
October  1967,  pp.  287-298. 

7.  Sakata,  M. ,  S.  Noguchi,  and  J.  Oizumi,  "Analysis  of  a  Processor- 
Shared  Queueing  Model  for  Time-Sharinr  Systems,"  Proc. ,  2nd  Hawaii 
International  Oonf.  on  System  Sciences,  January  1969,  pp.  625-628. 


34 


APPENDIX  B 


PROCESSOR-SHARING  QUEUEING  MXELS  OF  MIXES  SCHEDULING 
DISCIPLINES  FOR  TIMD -SHARES  SYSTEMS 

by  L.  ICLeinrock  and  R.  R.  Mintz 


35 


PROCESSOR-SHARING  QLEUEING  KXEIS  OF  MIXED  SCHEDULING  DISCIPLINES 

FOR  TDC-SHAHSD  SYSTEMS* 

by  L.  Kleinrock  end  R.  R.  Mintz 

Ccrputer  Science  Department 
university  of  California,  Los  Angeles,  California 

ABSTRACT 

Scheduling  algorithms  foe  time- shared  ccrputing  facilities  are 
considered  in  terms  of  a  queueing  theory  model.  The  extremely  useful  limit 
of  "processor-sharing"  is  adopted,  wherein  the  quantum  of  service  shrinks  to 
zero;  this  approach  greatly  simplifies  the  problem.  A  class  of  algorithms 
is  studied  for  which  the  scheduling  discipline  may  change  for  a  given  job  as 
a  function  of  the  anoint  of  service  received  by  that  job.  These  multilevel 
disciplines  form  a  natural  extension  to  irony  of  the  disciplines  previously 
considered. 

The  average  response  time  for  jobs  conditioned  on  their  service  require¬ 
ment  is  solved  for  in  tide  paper.  Explicit  solutions  are  given  for  the 
system  H/G/l  in  which  levels  may  be  first-ocme- first-served  (EOS) ,  feedback 
(FB)  or  roind-robin  (RR)  in  any  order.  The  service  time  distribution  is 
restricted  to  be  a  polynomial  tines  an  exponential  for  the  case  of  RR. 

Exanples  are  described  for  which  the  average  response  time  is  plotted. 
These  exanples  display  the  great  versatility  of  the  results  and  demonstrate 
the  flexibility  available  for  the  intelligent  design  of  discriminatory  treat¬ 
ment  among  jobs  (in  favor  of  short  jobs  and  against  long  jobs)  in  time-shared 
ccrputer  systems. 


•This  work  was  supported  by  the  Advanced  Research  Projects  Agency  of 
the  Department  of  Defense  (DAHC-1 5-69-0-02 85) . 
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PROCESSOR-SHARING  QIEIEDIG  KXE£  OF  MIXED  SOCDUUNG  DISCIPLINES 

FOR  TDC-SHAICD  SYSTEMS* 


by  L.  Kleinrock  and  R.  R.  Muntz 

Qxputer  Science  Department 
Uhiversity  of  California,  Los  Angeles,  California 


1,  DflRODUCTICN 

Queueing  models  have  been  used  successfully  in  the  analysis  of  time- 
shared  oonputer  systems  since  the  appearance  of  the  first  applied  paper  in 
this  field  in  1964  [1] .  A  recent  survey  of  this  work  is  given  by  McKinney  [2] . 
One  of  the  first  papers  to  consider  the  effect  of  feedback  in  queueing  systems 
was  due  to  Takacs  [3] . 

One  of  the  goals  in  a  time-shared  ccnputer  system  is  to  provide  rapid 
response  to  those  tasks  which  are  inter-active  and  which  place  frequent, 
but  small, demands  on  the  system.  As  a  result,  the  system  scheduling  algo¬ 
rithm  must  identify  those  demands  which  are  small,  and  provide  them  with 
preferential  treatment  over  larger  demands.  Since  we  assure  that  the 
scheduler  has  no  explicit  knowledge  of  job  processing  times,  this  identifi¬ 
cation  is  aoooqpiished  implicitly  by  "besting"  jobs.  That  is,  jobs  are 
rapidly  provided  small  amounts  of  processing  and,  if  they  are  short,  they 
will  depart  rather  quickly;  otherwise,  they  will  linger  while  other,  newer 
jobs  are  provided  this  rapid  service,  etc.,  thus  providing  good  response  to 
snail  demands. 

Generally,  an  arrival  enters  the  time-shared  system  and  capetes  for 
the  attention  of  a  single  processing  unit.  This  arrival  is  forced  to  wait 


•This  work  was  supported  by  the  Advanced  Research  Projects  Agency  of 
the  Department  of  Defense  (DAHC-15-69-C-0285) . 
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in  a  systan  of  queues  until  he  is  permitted  a  quantun  of  service  time;  when 
this  quantun  expires,  he  is  then  required  to  join  the  systan  of  queues  to 
await  his  second  quantun,  etc.  The  precise  model  for  the  systan  is  developed 
in  Section  2.  In  1967  the  notion  of  allowing  the  quantun  to  shrink  to  zero 
was  studied  (4]  and  was  referred  to  as  "prrceesor-sharing",  in  1966  Schrage 
[18]  also  studied  the  zero-quantun  limit.  As  the  none  implies,  this  sero- 
quantun  limit  provides  a  share  or  portion  of  the  processing  unit  to  many 
customers  simultaneously;  in  the  case  of  round-robin  (RR)  scheduling  [4] , 
all  customers  in  the  systan  simultaneously  share  (equally  or  on  a  priority 
basis)  the  processor,  whereas  in  the  feedback  (FB)  scheduling  [5]  only  that 
set  of  customers  with  the  anal  lest  attained  service  share  the  processor.  He 
use  the  term  processor-sharing  here  since  it  is  the  processing  unit  itself 
(the  central  processing  unit  of  the  cceputer)  which  is  being  shared  among 
the  set  of  the  customers;  the  phrase  "time-sharing"  will  be  reserved  to  isply 
that  customers  are  waiting  sequentially  for  their  turn  to  use  the  entire 
processor  for  a  finite  quantun.  In  studying  the  literature  one  finds  that 
the  obtained  results  appear  in  a  rather  ao^alex  form  and  this  oceplsxity 
arises  fran  the  fact  that  the  quantun  is  typically  assuned  to  be  finite  as 
opposed  to  infinitesimal.  When  one  allows  the  quantun  to  shrink  to  aero, 
giving  rise  to  a  processor -sharing  systan,  then  the  difficulty  in  analysis 
as  well  as  in  the  form  of  results  disappears  in  large  part;  one  is  thus 
encouraged  to  consider  the  processor-sharing  case.  Clearly,  this  limit  of 
infinitesimal  quantun*  is  an  ideal  and  can  seldom  be  reached  in  practice 
due  to  overhead  considerations;  nevertheless,  its  extreme  simplicity  in 
analysis  and  results  brings  us  to  the  studies  reported  in  this  paper. 

The  two  processor -sharing  systems  studied  in  the  past  are  the  RR  and 

*This  limiting  case  is  not  unlike  the  diffusion  approximation  far 
queues  (see,  for  exanple,  Gaver  (15)). 
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the  FT  (4# 5] .  Typically,  the  quantity  solved  for  is  T(t),  the  expected 
response  tine  conditioned  on  the  cue  toner 's  service  tine  t;  response  tine 
is  the  tine  fron  when  a  cus toner  enters  the  system  until  he  leaves 

ccroletely  serviced.  This  measure  is  especially  important  since  it  exposes 
the  preferential  treatment  given  to  short  jcbo  at  the  expense  of  the  long 
joes.  Clearly,  this  discrimination  is  purposeful  since,  as  stated  above, 
it  is  the  desire  in  tine-shared  systems  that  am&l)  requests  should  be 
allowed  to  pan  through  the  system  quickly.  In  1969  the  distribution  for 
the  response  tins  in  the  RR  system  was  found  [6] .  In  this  paper  we  consider 
the  case  of  mixed  scheduling  algor ittae  whereby  customers  arc  treated 
according  to  the  RR  algorithm,  the  FB  algorithm,  or  first  come  first  scivod 
(PCFS)  algorithn,  depending  upon  how  much  total  service  time  they  have  al¬ 
ready  received.  Thus,  as  a  cus  tern*.:  proceeds  through  the  system  obtaining 
service  at  various  rates  he  is  treated  according  to  different  disciplines; 
the  policy  which  is  applied  enong  customers  in  different  levels  is  that  of 
the  FB  system  as  explained  further  in  Section  2.  Thus,  natural  generaliza¬ 
tion  of  the  previously  studied  processor-sharing  system  allows  one  to 
exeats  a  large  ruaber  of  new  and  interesting  disciplines  whose  solutions 
we  present. 

A  acre  restricted  study  of  this  sort  was  reported  by  the  authors  in 
[16] .  Here  we  make  use  of  the  additional  results  from  [11]  to  generalize 
our  analysis. 


2.  THE  HOCEL 

The  model  we  choose  to  use  in  representing  the  scheduling  algorithms 
is  dram  fron  queueing  theory.  This  corresponds  to  the  many  previous  models 
studied  [1,2,4,5,6,7,8,18],  all  of  which  may  ha  thought  of  in  terms  of  the 
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struucare  shown  in  Pig.  2.1.  In  this  figure  Me  indicate  that  ntv  requests 
enter  the  system  in  queues  i^on  arrival.  Whenever  the  oaqxjter's  central 
processing  unit  (CPU)  becomes  free,  some  customer  is  allowed  into  the  service 
facility  for  an  amount  of  time  referred  to  as  a  quantum,  if,  during  this 
quartun,  the  total  aocvwulated  service  for  a  customer  equals  his  required 
service  tine,  then  he  departs  the  system;  if  not,  at  the  end 
he  cycles  back  to  the  system  of  queues  and  waits  wtil  he  is 
additional  service.  The  system  of  queues  may  order  the 
to  a  variety  of  different  criteria  in  order  to  select  the  next 
receive  a  quantun.  In  this  paper,  we  eeeunc  that  the  only  measure  used  in 
evaluating  this  criterion  is  the  amount  of  attained  service  (total  service 
so  far  received) . 

In  order  to  specify  the  scheduling  algoirthm  in  terms  of  this  model,  it 
Is  required  that  we  identify  the  following: 

a.  The  customer  interarrival  tine  distribution.  We  assime  this  to  be 
exponential,  i.e., 

P (interarrival  time  <  t)  «•  1  -  t  >  0  (2.1) 

where  X  is  the  average  arrival  rate  of  customers. 

b.  The  distribution  of  required  — rvioe  time  in  the  CPU.  This  we 
assume  to  be  arbitrary  (but  independent  of  the  interarrival  times) .  We  thus 
assure 

P (service  tine  <  x)  ■  B(x)  (2.2) 

Also  assure  1 /u  ■  average  service  time. 

c.  The  quantun  site.  Here  we  assuas  a  processor-shared  model  in  which 
cue  toners  receive  an  equal  but  vanishingly  email  anoint  of  service  each  time 
they  are  allowed  into  service.  For  more  discussion  of  such  system,  see 
(4,6,7,18) . 
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d.  The  system  of  queues.  We  oonsider  here  a  generalization  and  con¬ 
solidation  of  nary  systems  studied  in  the  past.  In  particular,  we  define  a 
set  of  attained  sezvioe  tines  {a^}  such  that 

0-ao<a1<«2<...  <^i<  Vl--  (2.3) 

the  discipline  followed  for  a  job  when  it  has  attained  service,  t,  in  the 
interval 

a^  <  t  <  a^  i  -  1,2, . . .  ,N  ♦  1  (2.4) 

will  lie  denoted  as  D. .  Me  consider  D,  for  any  given  level  i  to  be 
t  1  1 

eithert  FIRST  CDMZ  FIFST  SERVED  (FCFS) ;  PROCESSOR  SHARED-FB  (FB) ;  or 

ROtMD-RGBlN  PROCESSOR  SHARD  (RR) .  the  FOS  system  needs  no  emanation, 
the  FB  system  gives  service  next  to  that  customer  who  so  far  has  least 
attained  service;  if  there  is  a  tie  (among  X  customers,  say)  for  this 
position,  then  all  K  menbers  in  the  tie  get  served  simultaneously  (each 
attaining  useful  service  at  a  rate  of  1/K  sec/sec) ,  this  being  the  nature 
of  processor  sharing  systems.  The  RR  processor  sharing  system  shares  the 
service  facility  among  all  customers  present  (say  J  customers)  giving 
attained  service  to  each  at  a  rate  of  1/J  sec/sec.  Moreover,  between 
intervals,  the  jobs  are  tree  ted  as  a  set  of  FB  disciplines  (i.e.,  service 
proceeds  in  the  1th  level  only  if  all  levels  j  <  i  axe  enpty) .  See 
Pig.  2.2.  Fbr  exarplc,  for  N  -  0,  we  have  the  usual  single-level  case 
of  either  FCFS,  RR,  or  FB.  for  N  -  1,  we  could  have  any  of  nine  disci¬ 
plines  (FCFS  followed  by  FCFS,  ...»  RR  followed  by  RR);  note  that  FB  followed 
by  FB  is  just  a  single  FB  system  (due  to  overall  FB  policy  between  levels) . 

As  an  illustrative  exasple,  consider  the  N  -  2  case  shown  in  Fig.  2.3. 
Any  new  arrivals  begin  to  tfiare  the  processor  in  a  RR  fashion  with  all 
other  customers  who  so  far  have  lass  than  2  seccrtds  of  attained  service. 
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Customers  in  the  range  of  2  <  t  <  6  may  get  served  only  if  no  customers 
present  have  had  less  than  2  seconds  of  service;  in  such  a  case,  that  cus¬ 
tomer  (or  customers)  with  the  least  attained  service  will  proceed  to  occupy 
the  service  in  an  FB  fashion  until  they  either  leave,  or  reach  t  »  6,  or 
sene  new  customer  arrives  (in  which  case  the  overall  FB  rule  provides  that 
the  RR  policy  at  level  1  precepts) .  If  all  customers  have  t  >  6,  then 
the  "oldest”  customer  will  be  served  to  completion  unless  interrupted  by  a 
new  arrival.  The  history  of  some  customers  in  this  exanple  system  is  shown 
in  Fig.  2.4.  Me  denote  customer  n  by  C^.  Note  that  the  slope  of  attained 
servioe  varies  as  the  number  of  customers  simultaneously  being  serviced 
changes.  Me  see  that  Cj  requires  5  seconds  of  servioe  and  spends  14  seconds 
in  system  (i.e. ,  response  time  of  14  seconds). 

So  nuch  for  the  system  specification.  Me  may  summarise  by  saying  that 
we  have  an  ty<Vl  queueing  system*  model  with  processor  sharing  and  with  a 
generalized  nultilevel  scheduling  structure. 

The  quantity  we  wish  to  solve  for  is 

T(t)  -  E  (response  time  for  a  customer  requiring  a 

total  of  t  seconds  of  attained  servioe)  (2.5) 

Me  further  make  the  following  definitions: 

T^(t)  -  E(tini  spent  in  interval  i  (a^,a^) 
for  customers  requiring  a  total  of  t 
seooncs  of  attained  servioe)  (2.6) 


denotes  the  single-server  queueing  system  with  Poisson  arrivals 
and  arbitrary  service  time  distribution;  similarly  tyty l  denotes  the  more 
special  oase  of  exponential  service  time  distribution.  One  mi^it  also  think 
of  ouo:  processor-sharing  system  as  an  infinite  server  model  with  constant 
overall  servioe  rate. 
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We  note  that 


T  (t)  -  T^t')  for  t,t'  > 


Furthermore,  we  have,  for  a^_^  <  t  <  a^. 


that 


(2.7) 


T(t)  -  £  T,(t)  (2.8) 

i-1  1 

Alao  we  find  it  convenient  to  define  the  following  quantities  with  respect 
to  B(t) : 


txB(t)  ♦  x  J  <SB(t) 
x 


(2.9) 


3,./*  *».<«♦**/ 

O  X 


dB(t) 


(2.10) 


(2.11) 


"x  *  HI?.,,) 


(2.U) 


Note  that  Wx  represents  the  expected  work  fovnd  by  a  new  arrival  in  the 
system  W<V  1  where  the  service  times  are  truncated  at  x. 
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3.  RESULTS  FDR  MULTILEVEL  QUEUEING  SYSTP6 


Vsc  wish  to  find  an  expression  for  T(t) ,  the  mean  system  time  (i.e. , 
average  response  time)  for  jabs  with  service  time  t  such  that 
ai_^  <  t  <  «i#  i.e.,  jobs  which  reach  the  i  level  queue  and  there  leave 

the  system.  Tb  acocsplimh  this  it  is  convenient  to  isolate  the  i  level 
to  some  extent.  Me  moke  use  of  the  following  two  facts. 

1.  By  the  assumption  of  preempt Ive  priority  of  lower  level  queues  (i.e., 
FB  discipline  between  levels)  it  is  clear  that  jdbe  in  levels  hicjver 
than  the  i  level  can  be  ignored.  This  follows  since  these  jdbe 
cannot  interfere  with  the  servicing  of  the  lower  levels. 

2.  Me  are  interested  in  jobs  that  will  reach  the  i**1  level  queue  and 
then  depart  from  the  system  before  passing  to  the  (i  +  1) 8t  level.  The 
system  tine  of  such  a  job  can  be  thought  of  as  occurring  in  two  parts. 
The  first  portion  is  the  time  from  the  job's  arrival  to  the  queueing 
system  until  dm  group  at  the  i**1  level  is  serviced  for  the  first  time 
after  this  job  has  reached  the  1th  level.  The  second  portion  starts 
with  the  end  of  the  first  portion  and  ends  when  the  job  leaves  the 
system.  It  is  easy  to  see  that  both  the  first  and  second  portions  of 
the  job's  system  time  are  maffectad  by  the  service  disciplines  used 

in  levels  1  through  i  -  1.  Therefore,  we  can  assure  any  convenient 
disciplines.  In  fact,  all  these  levels  can  be  limped  into  one  equiva¬ 
lent  level  which  services  jobs  with  attained  service  between  0  and 
a^_j  seconds  using  ary  service  discipline. 

Prom  (1)  and  (2)  above  it  follows  that  we  can  solve  for  T(t)  for  jobs 
that  leave  the  syston  from  the  level  by  considering  a  two-level  system. 

The  lower  level  services  jobs  with  attained  service  between  0  and  a^ 
whereas  the  second  level  services  jobs  with  attained  service  between  a  ^ 
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cf 

and  a  .  Jobs  that  would  have  passed  to  the  i  +  1  level  after  receiving 
a^  seconds  of  service  in  the  original  system  are  now  assured  to  leave  the 
system  at  that  point.  In  other  words  the  service  times  are  truncated  at  a^. 

3.1  1th  Level  Discipline  is  FB 

Consider  the  two-level  system  with  the  second  level  corresponding  to 
the  i  level  of  the  original  ays  tan.  Since  we  are  free  to  rhoose  the 
discipline  used  in  the  lower  level,  we  can  assure  that  the  FB  discipline  is 
used  in  this  level  as  well.  Clearly  the  two-level  system  behaves  like  a 
single  level  FB  system  with  service  times  tnneated  at  a^.  The  solution 
for  such  a  system  is  Known  (5,9) : 


T(t) 


2(1  -  p<t) 


7 


(3.1) 


3.2  1th  Level  Discipline  Is  PCTS 

Consider  again  the  two-level  system  with  breakpoints  at  a^  and  a^. 

Regardless  of  the  discipline  in  the  lower  level,  a  tagged  job  entering  the 

system  will  be  delayed  by  the  sun  of  a)  the  work  currently  in  both  levels 

(■  W  )  plus,  b)  any  new  arrivals  to  the  lower  level  queue  during  the  interval 
ai 

(average  T(t)  )  this  job  is  in  the  system.  These  new  arrivals  form  a  Poisson 

process  with  parameter  X  and  their  contribution  to  the  delay  is  a  random 

-y 

variable  whose  first  end  second  moments  are  t  and  t *  respectively. 

^*i-l  ai-l 

Thus  we  have 

T(t)  -  W,  4  T(t)  ♦  t 

ai  ai-l 

and  so 
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T(t) 


(3.2) 


+  t 


wherp  Wa^  is  given  by  Bq.  (2.12) .  It  is  also  possible  to  use  these 
methods  for  solving  last-come- first-served  and  random  order  of  service  at 
any  level. 

3.3  1th  Level  Discipline  is  Wl 

In  this  case,  our  results  arts  limited  in  the  1th  interval  to  service 
time  distributions  in  which 

B(x)  -  1  -  p(x)e~tlx  a^  <  x  <  a^  (3.3) 

p(x)  «  PD  ♦  Pxx  ♦  •••  ♦  Pnxn  (3.4) 

Ajk. 

The  service  time  distribution  F(x) ,  for  this  i  interval  is  then 


P(x) 


B(a.  .  ♦  x)  -  B(a.  ,)  - 

1-1  1  —  -  1  -  q(x)«'flx  o  <  x  <  a 


1  ~ 

1 


i  "  Vl 


(3.3a) 


x  >  aA 


-  a, 


i-1 


trfierc 


*®*i«l 

•  P<\-i  ♦  x) 

<*<*>  -  - 1  I  ail  i -  -  %  ♦  V  4  —  4 


l  -  Bto^j) 


(3.4a) 


Thus  we  permit  in  this  interval,  service  time  distributions  of  the  form: 

1  minus  a  polynomial  of  degree  n  times  an  exponential.  The  analysis  of 
this  syston  appears  in  (11);  we  make  use  of  these  results  below.  Neverthe¬ 
less,  we  develop  our  analysis  as  far  as  possible  for  the  case  of  general 
B(x)  before  specializing  to  the  class  giver  by  Eqa.  (3.3)  (3.4). 
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Ws  start  by  considering  the  two-level  system  with  breakpoints  at  a^  ^ 
and  a^.  Consider  the  buBy  periods  of  the  lower  level.  During  each  such 
busy  period  there  may  be  a  nunber  of  jobs  that  pass  to  the  higner  level. 

We  choose  to  consider  these  arrivals  to  the  higher  level  as  occurring  at 
the  end  of  the  lower  level  txry  period  so  that  there  is  a  bulk  arrival  to 
the  higher  level  at  this  time.  We  also  choose  to  temporarily  delete  these 
lower  level  busy  periods  from  the  tine  axis.  In  effect  we  create  a  virtual 
time  axis  telescoped  to  delete  the  lower  level  busy  periods.  Sinoe  the  time 
from  the  end  of  one  lower  level  busy  period  to  the  start  of  the  next  is 
exponentially  distributed  (Pols sen  arrivals!) ,  the  arrivals  to  the  higher 
level  appear  in  virtual  time  to  be  bulk  arrivals  at  instants  generated  from 
a  Poisson  process  with  parameter  A. 

Consider  a  tagged  job  that  required  t  -  a^  ^  ♦  t  seconds  of  service 
(0  <  t  <  a^  -  a^_^) .  Let  be  the  mean  real  time  the  job  spends  in  the 
system  until  its  arrival  (at  the  end  of  the  lower  level  busy  period)  at  the 
iiigher  level  queue.  Let  c^(t)  be  the  mean  virtual  time  the  job  spends  in 
the  higher  level  queue. 

^  can  be  calculated  as  follows.  The  initial  dt-lay  is  equal  to  the 
mean  work  the  job  finds  in  the  lower  level  on  arrival  plus  the  a^  seconds 
of  work  that  it  contributed  to  the  lower  level.  Therefore 


W 


♦  At 


i-1 


•i-l**1 


♦  a 


i-1 


and  so 


*i-i 1 


•i-i 


•i-i  1 


(3.5) 
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In  the  tucker  level. 


If  ^(t)  is  the  naan  virtual  time  the  ]ch  spends 
we  can  easily  aonvert  this  to  the  moan  retd  tire  spent  in  thus  level.  In 
the  virtual  tire  interval  j^(t)  there  are  an  average  of  Xc^(t)  lower 
level  busy  periods  that  have  been  ignored.  Each  of  these  has  a  mean  length 

S-i 

of  t -  .  Therefore,  the  mean  real  tire  the  job  spends  in  the  higher 

'  p<ai-i 

level  is  given  by 


c^(t)  ♦  Xc^(t)  y  ~ 


i-1 


"i-l 


^(t) 

r^s: 


(3.6) 


"i-l 


acebinlng  these  results  we  see  that  a  job  requiring  t  -  «  ^  +  t 
seconds  of  servioe  has  mean  system  tire  given  by 

+  ”  ■  ITpL  1%.!  +  *1-1  +  °2'Tl|  <3-7) 

The  only  unknown  quantity  in  this  equation  is  cUj(t)  .  TO  solve  for 
^(t)  we  must,  in  general,  consider  an  tVC/l  system  with  bulk  arrival  and 
HR  processor  sharing.  The  only  exception  is  the  case  of  RR  at  the  first 
level  tdilch  has  only  single  arrivals.  Since  the  hiaher  level  queues  can  be 
intoned,  the  solution  in  this  exceptional  case  is  the  sane  as  for  a  round- 
robin  single  level  system  with  servioe  times  truncated  at  a^.  Therefore, 
frm  (8)  we  have  for  the  first  level 


T(t)  -  r - 1 —  0  <  t  <  a.  (3.8) 

1  "  p<ax  1 

Let  ub  now  consider  the  bulk  arrival  RR  system  in  isolation  in  order 
to  solve  for  the  virtual  time  spent  in  the  hitter  level  queue,  ^(t)  .  The 
service  time  distribution  for  this  bulk  arrival  system  is 
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F(x) 


♦  x)  -  Bta^) 

— i  -  bu^j - 


1 


0  <  x  <  -  ai_^ 


x  i  ai  '  ai-l 


Note  that  the  utilization  factor  for  this  bulk  system  is 

p  ■  (3.9) 


where  a  is  the  moan  nteber  of  arrivals  in  a  bulk  and 
the  distribution  F(x) .  Let  us  begin  by  solving  for  a. 


-  is  the  mean  of 
ui 


This  we  do  for 


the  general  case  a^,  a^.  a  is  just  the  mean  mnber  of  jobs  that  arrive 
during  a  low  level  busy  period  and  require  more  than  a^_1  seconds  of 
service.  Therefore  a  oust  satisfy  the  equation 


a  -  At  a  +  (1  -  B(a,  ,))1 
^i-1  1  1 


(3.10) 


In  this  equation  At  is  the  mem  nurber  of  jobs  that  arrive  during  the 

^i-l 

service  time  of  the  first  job  in  the  busy  period.  Since  each  of  these  jobs 

in  effect  generates  a  busy  period,  there  are  an  avenge  of  At  a 

^i-1 

arrivals  to  the  upper  level  queue  due  to  these  jobs.  The  second  term  is 
just  the  average  nwber  of  times  that  the  first  job  in  the  busy  period  will 
require  no  re  than  a^  seconds  of  service. 

Clearly  then 

_  1  -  B(a,  , ) 

a  "  I  -  p"  -  --  (3.11) 

**1-1 

In  (11),  an  integral  equation  is  derived  which  defines  <^(t)  for  the 
RR  bulk  arrival  system;  we  repeat  that  equation  below 
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0.12) 


®2 (t)  ■  Xa  J  o£(x)(l  -  F(x  ♦  i))tbt 
o 

i 

♦  tiif  ai(x)  (1  -  r(T  -  x)]dx 

o 

♦  1  ♦  bll  -  F(T)) 


where  a^(r)  -  do^x )/dr,  and  b  -  naan  ranter  of  arrivals  with  the  tagged 
job.  The  solution  to  this  integral  equation  for  the  restricted  service 
tine  distributions  as  given  in  Bqs.  (3.3a)  and  (3 .4a)  is  also  given  in  (11] , 
and  for  our  problem  takas  the  form 


OjW 


-v  t  “VS  YmT 

.  WU  *  *  m  ]  “  °i(Yn>#  l* 

<W  ♦ 


1) 


(3.13) 


where 


-  *1  -  Vi 


9o(x) 


-  (X  ♦ 


8)ml  -  Xa 


qU)  (o)  (x  ♦ 


0) 


n-k 


°l(x)  "  x«  £  e^'q^  (xx)  (x  ♦  X)"* 
k^> 

°2(x)  "  V^V"*0  * 


(3.14) 

0.15) 

(3.16) 

(3.17) 
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and  where  Eq.  (3.17)  has  roots  (occurring  in  pairs)  x  -  -r  ,  Y  for 

ffl  to 

r  »  1,  2,  n  +  1  and  the  notation  f  ^  (y)  denotes  the  deriva¬ 

tive  of  f  with  respect  to  its  argvsaent  evaluated  at  a  value  y. 

In  the  solution  for  c^(t)  given  in  Bq.  (3.13) ,  we  are  required  to 
oaepute  b(-  naan  rxnber  of  arrivals  with  a  tagged  job) .  This  we  do  by 
first  deriving  an  expression  for 

m 

G(s)  E  £  Ptbulk  sias  -  ytzk  (3.18) 

kmo 

t^iich  is  the  probability  generating  function  (z-transform)  for  the  bulk 
else.  Either  by  direct  arguments  based  upon  busv  periods  or  by  use  of  the 
method  of  collective  marks  [12] ,  wa  readily  arrive  at 

__  (la.  « )  3  -Xa.  «  . 

G(t)  -  (1  -  B(«i.l,1*£  “j|“«  tG(*))3 

♦  »<Vi>[/  1  1  E  o.ui 

•  • 

In  »|.  (3.19)  the  first  tern  is  oonditlonad  an  ths  assunptlan  that  the 
customer  who  pcentpts  service  from  those  at  level  i  reachee  the  1th  level; 
the  eaoond  tern  assumes  that  he  doss  not  reach  level  i.  Bq.  (3.19)  reduces 
to 

-Xa,  ,tl-G(*)}  /i-1  -Xt(l-G(*)J 

G(z)  -  (1  -  B(«.  ,))*a  11  ♦  /  a  dB(t)  (3.20) 

*o 

for  arbitrary  B(x) ,  wa  onset  reduce  this  last  expression  any  further. 
Nevertheless,  we  can  obtain  moments  from  it.  m  particular,  from  ths 
definition  of  a,  we  obtain 
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a  -  a' (2) 


1  ■  B(ai-1) 


2-1  1  -  Xt 


<a 


i-1 


which  is  exactly  as  obtained  by  more  direct  arguments  in  Bq.  (3.11). 
However,  we  are  seeking  b.  For  this  we  most  calculate 


6"  (z) 


z-1 


(a)2 


<a 


i-1 


[ 


2Xai-l(1  *  P<a 


i-1 


2T 
)  +  X  V 


«  (3. 2i) 

^i-lJ 


Now  since  the  mean  group  size  (1  +  b)  of  a  tagged  customer's  group  is 
related  to  the  bulk  size  distribution  as  the  mean  spread  is  related  to  the 
inter-event  distribution  (namely  the  mean  spread  equals  the  second  moment 
of  the  inter-event  interval  divided  by  the  first  morvait)  [13] ,  we  have 


.  ,  .  _  second  moment  of  bulk  size 
first  moment  hulk  size 


(3.22) 


or 


b  - 


G"(z) 

STST 


I 

z-1 


(3.23) 


From  Bq.  (3.20)  we  get 


-  SJ  +  xV«  J 


(3.24) 


Having  solved  for  o,(t)  we  ray  new  substitute  back  into  Fq.  (3.7) 
which  solves  the  case  when  the  i**1  level  discipline  is  HP  and  service 
time  is  of  the  form  given  in  Eqs.  (3.3a)  and  (3.4a) .  (Note  that  for  i  -  1, 
the  solution  given  in  Eq.  (3.8)  is  good  for  any  B(x).) 
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It  is  instructive  to  display  the  solution  for  T(t)  explicitly  in  a 
til 

special  case  for  our  i  level  KR  discipline.  We  choose  the  multilevel 
system  with  ty/ty/l  and  solve  for  T(a^_1  +  t)  after  substituting  c^(x) 
into  Eq.  (3.7).  Note  for  jyty'l  that  q(t)  =  =  1.  Also,  frcm  Eqs. 

(3.14  -  3.17) ,  and  choosing  6  =  y, 

Qo(x)  -  x  +  y  -  Xa  (3.25) 

-yx. 

Qx(x)  -  Xae  -1  (3.26) 

,  ,  -2yx.  , 

Q2(x)  -  y  -  2yXa  +  (Xa)  (1  -  e  A)  -  (3.27) 

thus  the  roots  of  Q2  (x)  are 

/  2  -  -  2  “2yx. 

±  Yx  -  ±  /  y  -  2yXa  +  (Xar(l  -  e  A)  (3.28) 


and 

1  1  -yxl 

i  -  i(l  -  e  A)  (3.29) 

M 

thus  frcm  these  and  Eg.  (3.13),  we  get 
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Also  from  B^s.  (2.9)  and  (2.10)  we  obtain 


t  .  i(l  -  e"wx> 
<x  S'  ’ 


(3.31) 


7  .  2,1  -  .-•«  -  p,.-**) 

X  XT 


(3.32) 


We  may  substitute  these  last  two  equations  into  Eqs.  (3.11)  and  (3.24)  to 
obtain  a  and  h  explicitly.  Also,  we  note  from  Eqs,  (2.12)  and  (3.  32)  that 


Ml  -  e 


-^i-l 


-ya 


w 


-  1*1-1® 


i-1 


) 


i-1 


,  \  -va,_. 

y2[l-i(l-e  1X)1 


(3.33) 


Finally,  we  may  substitute  this  expression  for  W  and  Eq.  (3.30)  which 

ai-l 

gives  a^t)  into  Bq.  (3.7)  which  gives  vs  the  explicit  form  for  T(t). 


4.  EXAMPLES 

In  this  section  we  demonstrate  through  exanples  the  nature  of  the 

l 

results  we  have  obtained.  Recall  that  we  have  given  explicit  solutions  for 
our  general  model  in  the  case  fyG/1  with  processor  sharing  where  the  allowed 
scheduling  disciplines  within  a  given  level  may  be  PCFS  or  IB;  if  the  dis¬ 
cipline  is  RR,  it  may  be  at  level  1  and  if  it  occurs  at  level  i  >  1,  must 
be  of  the  form,  given  in  Eqs.  (3.3a)  and  (3.4a) . 

We  begin  with  four  exanples  from  the  system  tyWl.  As  mentioned  in 
Section  2,  we  have  nine  disciplines  for  the  case  N  =  1.  This  comes  about 
since  we  allow  any  one  of  three  disciplines  at  level  1  and  any  one  of  three 
disciplines  at  level  2.  As  we  have  shown,  the  behavior  of  the  average 
conditional  response  time  in  any  particular  level  is  independent  of  the 

i 

discipline  in  all  other  levels;  thus  we  have  nine  disciplines.  In  Fig.  4.1 
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we  show  the  behavior  of  each  of  the  nine  disciplines  for  the  system  N  =  1. 

In  this  case  we  have  assumed  y  =  1,  X  =  0.75,  and  a^  =  2.  From  Eg.  (3.1) 
we  see  that  the  response  time  for  the  FB  system  is  completely  independent 
of  the  values  a^  and  therefore  the  curve  shown  in  Fig.  4.1  for  this  re¬ 
sponse  time  is  applicable  to  all  of  our  M/M/1  cases.  Note  the  inflection 

» 

point  in  this  curve  and  that  the  response  time  grows  linearly  as  t  °° 

(a  phenomenon  not  observable  from  previously  published  figures  but  easily 
seen  from  Eq.  (3.1)).  As  can  be  seen  from  its  defining  equation,  the  re¬ 
sponse  time  for  PCFS  is  linear  regardless  of  the  level;  the  RR  system  at 
level  1  is  also  linear,  but  as  we  see  from  this  figure  and  from  Eq.  (3.13) 
the  RR  at  levels  i  >  1  is  nonlinear.  Thus  one  can  determine  the  behavior 
of  any  of  nine  possible  disciplines  from  Fig.  4.1.  Adiri  and  Avi -Itzhak  con¬ 
sidered  the  case  (FB,  RR)  [14] . 

Continuing  with  the  case  ty'fy'l,  we  show  in  Fig.  4.2  the  case  for  N  =  3 
where  =  RR,  d2  =  FB,  =  PCFS,  and  D4  =  RR.  In  this  case  we  have 
chosen  a^  =  i  and  y  =  1,  X  =  0.75.  Vfe  also  show  in  this  figure  the  case 
FB  over  the  entire  range  as  a  reference  curve  for  ocnparison  with  this  dis¬ 
cipline.  Note  (in  general  for  M/M/1)  that  the  response  time  for  any  disci¬ 
pline  in  an  given  level  must  either  coincide  with  FB  curve  or  lie  above  it 
in  the  early  part  of  the  interval  and  below  it  in  the  latter  part  of  the 
interval;  this  is  true  due  to  the  conservation  law  [15] . 

The  third  example  for  fyM/1  is  for  the  iterated  structure  =  PCFS. 

Once  again  we  have  chosen  y  =  1,  X  =  0.75,  and  a^  =  i.  Also  shown  in  this 
figure  is  a  dashed  line  corresponding  to  the  IB  system  over  the  entire  range. 
Clearly,  one  may  select  any  sequence  of  FB  and  FCFS  with  duplicates  in 
adjacent  intervals  and  the  behavior  for  such  systems  can  be  found  from 
Fig.  4.3.  It  is  interesting  to  note  in  the  general  M/G/l  case  with  =  PCFS 
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that  we  have  a  solution  for  the  FB  system  with  finite  quantum 
where  preemption  within  a  quantum  is  permitted! 

Our  fourth  exairple  is  for  an  10V1  system  with  =  RR  and  is  shown 
in  Fig.  4.4.  Here  we  use  the  explicit  form  for  T(t)  derived  from  Eqs. 
(3.7),  (3.  30),  and  (3.33).  We  maintain  the  same  value  m  =  1,  X  =  0.75, 
a^^  *  2,  a2  =  5.  T(t)  for  this  sytem  is  shown  in  Fig.  4.4. 

We  leave  ty/tyO.  now  and  give  two  exanples  far  M/Q/l.  For  the  first 
example  we  choose  the  system  Vi/E^/1  with  N  -  1.  In  this  system  we  have 


Sgi-  (2rt2xe'2wt  x  >  0  (4.1) 

We  note  that  the  mean  service  time  here  is  again  given  by  1/u;  the  second 

2 

moment  of  this  distribution  is  3/2m  .  We  calculate 


1 


1  *“2ya7  2 

Je  X[1  +  2ms.  +  2^)^] 


(4.2) 


We 


choose  the  system  N  *  1  with  =  KR  and  D2  =  FCFS. 


For  the  cases 


a.^  =  l/2y,  1/m,  2/y,  4/m,  and  00  with  m  ■  1  and  X  =  0.75  we  show  in 
Fig.  4.5  the  behavior  of  this  system. 

The  last  exairple  we  use  is  for  the  following  service  time  distribution: 


dB.  (x) 

bi<*> 5  -Sr- 


=  < 


-2(x  -  j 


e 


°<x<? 


7<X 


(4.3) 


as  shown  in  Fij.  4.6.  In  this  case,  t  1  *  |>  =  i,  t  =  |>  ?  =  j 

K2  *2 
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We  choose  the  system  =  PCFS,  ~  HR'  and  =  PCFS  with  =  i, 

3 

a^  =  and  A  =  0.75.  The  performance  of  this  system  is  given  in  Fig.  4.7. 

These  examples  demonstrate  the  broad  behavior  obtainable  from  our 
results  as  one  varies  the  appropriate  system  parameters.  In  all  cases  the 
system  discriminates  in  favor  of  the  short  jobs  and  against  the  longer  jobs. 

5.  OOMCLUSICN 

Our  purpose  has  been  to  generalize  results  in  the  modelling  and  analysis 
of  time-shared  system.  The  class  of  systems  considered  was  the  processor¬ 
sharing  systems  in  which  various  disciplines  were  permitted  at  different 
levels  of  attained  service.  The  principle  results  far  M/tyi  are  the 
following:  (1)  the  performance  far  the  FB  discipline  at  any  level  is  given 
by  Eg.  (3.1) ;  (2)  the  performance  for  the  PCFS  discipline  is  linear  with  t 
within  any  level  and  is  given  by  Eq.  (3.2);  (3)  the  performance  for  the  RR 
discipline  at  the  first  level  is  well  known  [8]  and  is  given  by  Eq.  (3.8) ; 

(4)  an  integral  equation  for  the  average  conditional  response  time  for  RR 
at  any  level  (equivalent  to  bulk  arrival  RR)  is  given  by  Eq.  (3.12)  and 
remains  unsolved  in  general;  however,  far  the  service  distribution  given  in 
Eqs.  (3.3a)  and  (3.4a),  we  have  the  general  solution  given  in  Eq.  (3.13)  as 
derived  in  [11] .  We  further  note  that  the  average  conditional  response  time 
at  level  i  is  independent  of  the  queueing  discipline  at  all  other  levels. 

Exanples  are  given  which  display  the  behavior  of  some  of  the  possible 
system  configurations .  From  these,  we  note  the  great  flexibility  available 
in  the  multilevel  systems.  From  the  exanples  in  Section  4,  we  see  that 
considerable  variation  from  previously  studied  algorithms  is  possible  so 
long  as  the  number  of  levels  is  less  than  a  snail  integer  (say  5) ;  however, 
we  see  that  as  N  increases ,  the  behavior  of  the  ML  systems  rapidly  approaches 
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that  of  the  pure  FB  system. 

Examination  of  the  envelope  of  the  multitude  of  response  functions 
available  with  the  ML  system  has  suggested  that  upper  and  lower  bounds  in 
systan  performance  exist;  this  in  fact  has  been  established  and  is  reported 
in  [19]. 
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.  Figure  2.1.  The  Feedback  Queueing  Model 
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Figure  2.3.  Example  of  N  ■  2 
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Figure  4.4.  Response  Time  for  Example  of  D,  ■  RR,  M/M/1, 
M  -  1.0,  X  -  0.75,  e,  -  2.0, «,  -  6.0 
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Figure  4.4.  Responae  Time  for  Example  of  D,  “  RR,  M/M/1, 
H  ■  1.0,  X  -  0.7S,  a1  ■  2.0,  aj  ■  6.0 
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TIGHT  BOUNDS  CM  THE  AVERAGE  RESPONSE  TIME  FOR 
TUG-SHARED  OfVIER  SYSTQ6* 

LEONARD  KLEINROCK,  RICHARD  R.  MtNTZ,  and  JIINN  HSU** 
Computer  Science  Department 

University  of  California,  Los  Angeles,  California,  U.S.A. 


In  this  paper,  sane  fundamental  properties  are  established  which  apply  to  the  average  response 
time  functions  for  all  time-shared  ccnputer  systems.  The  first  property  is  one  of  mono  tonicity. 

The  second  is  a  conservation  law  which  provides  insight  into  the  trade-offs  available  as  one  varies 
the  response  time  fwcticn  by  changing  the  scheduling  algorithm. 

The  main  thrust  of  the  paper  is  to  establish  tight  upper  and  lower  bounds  on  the  average  response 
time.  All  these  equilibrium  results  are  good  for  Poisson  arrivals,  arbitrary  service  time  distribu¬ 
tion  and  arbitrary  (but  work-conserving)  scheduling  algorithms  which  can  take  advantage  only  of  ar¬ 
rival  time  and  attained  service  time.  Examples  of  these  properties  are  given  for  a  number  of  serv¬ 
ice-time  distributions  and  scheduling  algorithms. 


1.  INTRODUCTION 


We  'are  in  the  midst  of  a  veritable  explosion 
regarding  the  number  of  published  papers  which 
give  analytical  results  far  computer  systems  I 
This  aeons  especially  true  in  the  modeling  and 
analysis  of  time-shared  computer  systems.  1 

It  is  fair  to  say  that  the  recognition  of  prob¬ 
abilistic  models  as  the  appropriate  method  for 
studying  these  systems  was  that  which  permitted 
the  breakthrough  in  analysis.  In  particular,  the 
use  of  queueing  theory  has  been  most  profitable  in 
this  analytic  work. 

As  a  result  of  this  flood  of  results,  each  ap¬ 
plying  to  a  slightly  different  set  of  assumptions, 
it  is  natural  that  we  should  seek  sane  order  in 
this  embarrassment  of  riches.  For  example,  do 
there  exist  any  invariants  in  behavior?  Can  we 
bound  the  possible  range  of  performance,  regard¬ 
less  of  structure?  What  constitutes  feasible  per¬ 
formance  profiles  for  these  systems?  These,  and 
many  more,  are  reasonable  inquiries  to  make  amidst 
the  confusion  of  results. 

In  trtis  paper  we  adept  the  point  of  view  that 
such  questions  are  important  and  nust  be  answered. 
Our  focus  is  on  a  class  of  models  for  time-shared 
oemputer  systems.  For  these  systems  we  axe  able 
to  state  a  mono  tonicity  property,  a  conservation 
law,  and  tight  upper  and  lower  bounds  on  the  sys¬ 
tem  performance  as  measured  by  average  response 
time. 

It  is  worthwhile  mentioning  that  numerous 
papers  have  recently  been  published  which  address 
themselves  to  bounds,  inequalities  and  approximate 
solutions  to  general  queueing  systems.  Among 
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these  are  Marshall  [2,3] ,  Kingman  [4] ,  Iglehart 
[5] ,  Daley  and  Moran  [6] ,  and  Gaver  [7]  to  mention 
a  few. 

2.  THE  OASS  OF  SYSTEMS 

Our  objective  is  to  create  some  order  among 
many  of  the  results  available  in  the  analysis  of 
time-shared  oaiputar  systems.  Let  us  consider  the 
class  of  systems  described  below. 

We  adopt  the  well-known  (8)  feedback  queueing 
model  for  time-shared  systems  shown  in  Fig.  2.1. 


ARRIVALS 


Fig.  2.1.  General  Feedback  Queueing  Msdel 

In  this  model  it  is  assured  that  the  central 
processing  unit  (CPU)  is  the  only  resource  being 
accessed.  Jcbs  arrive  according  to  a  Poisson 
process  with  an  average  arrival  rate  A  jobs/sec. 
They  each  bring  a  demand  for  service  by  the  CPU  in 
an  amount  equal  to  t  seconds,  where  these  de¬ 
mands  are  chosen  independently  from  the  service 
time  distribution  B(t)  » 


B(t)  ■  Plservioe  time  <  t  seconds] 


(2.1) 


Mb  define  the  usual  moments  of  service  time  as 


tf*  -  Bit?*]  -  J  tn<B(t) 


(2.2) 


vnera 


E  denotes  the  expectation  operator. 
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tfe  further  define  the  utilization  factor 


3 


p  ■  It 


(2.3) 


tyctu  Arrival ,  a  job  enters  the  system  of  queues 
where  tie  waits  for  a  "turn"  at  service.  When,  fin¬ 
ally,  his  turn  canes  up,  he  is  provided  a  quantum 
of  service  equal  to  q  seconds.  If  he  requires 
less  than  (or  equal  to)  q  seconds,  he  departs 
upon  canpietion;  if  not,  he  returns  to  the  system 
of  queues  having  been  partially  served,  in  which 
case  we  say  that  he  has  an  attained  servioe  of  q 
seconds.  Eventually,  he  will  be  permitted- a  sec¬ 
ond  quantum,  etc. ,  finally  leaving  when  his  total 
attained  service  equals  his  required  servioe  time, 
we  assume  that  no  overhead  (in  time)  is  incurred 
in  transferring  custaners  in  and  out  of  service 
(i.e.,  no  loss  or  swap-time);  it  is  possible  to 
account  far  swap-time  [9]  in  these  models,  but  we 
do  not  pursue  that  matter  here. 


the  decision  rule  which  chooses  the  next  custo¬ 
mer  to  receive  a  quantum  is  referred  to  as  the 
scheduling  algorithn.  Me  assume  that  the  schedul¬ 
ing  algorithm  makes^use  only  of  X,B(t),  a  job's 
arrival  time  and  a  job's  attained  service. 


In  this  paper,  we  consider  a  very  usefuil  spe¬ 
cial  case  of  the  above  model  in  which  we  permit 
the  quantum  q  to  approach  zero,  this  limit  is 
.known  as  the  processor-sharing  model  [10]  for 
time-shared  system.  In  this  case,  oust  model  in 
Fig.  2.1  becomes  that  of  Fig,  2.2  in  which  mote 
than  one  customer  (say  n)  may  be  sharing  the 
grocessor  simultaneously;  in  juch  a  cuse  each 
customer  receives  servioe  at  a  rats  of  1/n 


T*  of  aervioe/aeoond. 


ARRIVALS 


It  CUSTOMERS 
IN  SERVICE, 

EACH  AT  RATE  l/N 


Fig.  2.2.  Feedback  Queueing  Nodal 
for  Processor  Sharing 

Response  time  is  the  interval  measured  from 
when  a  customer  arrives  denanding  service  until  he 
departs  fully  serviced.  For  a  customer  requiring 
t  seconds  of  servioe,  the  average  response  time 
is  denoted.4 


T(t)  -  average  response  time  for  customer  r  .. 
requiring  t  seconds  of  servioe 


this  quantity  is  usually  taken  as  the  measure  of 


systems  we  consider  are  assumed  to  be  equili¬ 
brium,  which  requires  p  <  1. 


*Since  we  have  p  <  1,  we  consider  steady-state 
results  only,  an  example  of  which  is  T(t) . 


performance  for  time-shared  systems  for  good  rea¬ 
son.  In  particular,  it  is  usually  desired  that 
short  jobs  (small  t)  be  given  preferential  treat¬ 
ment  over  long  jobs;  this  discriminatory  perform¬ 
ance  is  easily  seen  through  the  function  T(  t) . 

A  function  closely  related  to  the  average  re¬ 
sponse  time  T(t) ,  is  the  average  wasted  or  wait¬ 
ing  time  W(t)  defined  as 

W(t)  *  T(t)  -  t  (2.$) 

Furthermore,  we  consider  a  third  related  function, 
W(t)/t  which  may  be  interpreted  as  the  penalty 
rate  to  jobs  requiring  t  seconds  of  service 
since  it  gives  the  ratio  of  the  cost  in  time 
(W(t) )  which  nust  be  paid  per  second  of  useful 
service  time  (t) . 

It  is  convenient  to  introduce  some  additional 
notation  at  this  point.  Let  us  define 

—  m 

tj  mj  tn«B(t)  +  xT  [1  -  B(x)  ]  (2.6) 

0 

i-U 

which  is  the  n  moment  of  the  servioe  time  dis¬ 
tribution  if  servioe  times  are  truncated  at  x 
seconds.  Also  let 


and 


°x  " 


W. 


27T 


(2.7) 


(2.8) 


Note  that  t  -  tn,  p  «  p  and  that  is  the 
expected  work  (backlog)  found  by  a  new  arrival  to 
the  qujeueing  sysbem5  tyQ'l  [11]. 


In  summary  then,  the  class  of  systems  we  con¬ 
sider  is  the  class  of  W/C/l  processor-shared  time¬ 
sharing  systems  with  zero  swap-time  and  arbitrary 
scheduling  algorithms. 


3.  lESPCNSE  TOE  FUNCTIONS 

From  the  published  literature,  we  find  many  re¬ 
sults  for  processor -shared  systems.  Some  of  these 
ws  describe  in  this  section. 


1.  Batch-processing ,  first-ccme-first- 
served  (PCTS) 

In  the  FCFS  system,  the  oldest  jab  in  the 
system  is  given  ocrrplete  use  of  the  CPU 
until  it  oaipletes  its  required  service. 
For  these  systems,  we  haws  [11] 


*4ha  notation  ty'Q/l,  aomron  in  queueing  theory, 
denotes  a  single  server  system  with  Poisson  arri¬ 
val  process  and  arbitrary  service  time  distribu¬ 
tion. 
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w(t)  -  w. 


w  ia--pT  (3,1) 

2.  Itound-Rabin  (RR) 

In  the  RR  system,  all  cus toners  share  the 
CPU  equally.  We  have  [10]®  ' 

» 

W(t)  *  ^  (3.2) 


that  the  penalty  rate  w(t)/t  is  independent  of 
serviae  time,  t. 

Ftor  these  prooessor -shared  systems,  it  is  use¬ 
ful  to  display,  in  one  figure,  the  wasted  time 
w(t) .  this  we  do  in  Fig.  3.1  for  the  case  of  ex¬ 
ponential  serviae  (see  Eq.  (3.3))  with  X  ■  0.75 
and  t  -  1.0  (thus  p  *=  0.75).  We  purposely 
siperiitpoee  the  performance  curves  for  many  sched¬ 
uling  disciplines.  We  are  confronted  with  quite 
a  selection  of  possible  performance  functions! 


3.  Selfish  Rouid-Rabin  (SRR) 

In  the  SRR  system,  all  customers  with-  the 
highest  value  of  "priority"  share  foe  CPU 
equally:  all  others  wait  in  foe  queue. 
Priority  for  a  job  is  calculated  a£  aw  +  6s 
where  a  >  8  >  0  are  constants  and ,  w  is 
the  time  spent  waiting  and  s  is  the  time 
spent  in  foe  CPU  (perhaps  shared)  for  that 
‘  job.  The  SRR  system  has  only  been  solved 
for  exponential  service  time,  i.e. , 

B(t)  -  1  -  e”pt  t  >  0  (3.3) 

In  this  oase  we  liave  [12] 

»<«  -*.♦  i3-« 


4.  Generalized  Foreground-Background  (FB) 

The  FB  system  shares  the  CPU  equally  among 
all  those  jobs  which  have  the  smallest  at¬ 
tained  service.  Fat  the  FB  system  we  have 
[13] 


W(t) 


Wt  +  tpt 

T^pT 


(3.5) 


5.  Multilevel  (Ft,) 

In  the  ML  system,  a  set  of  attained  serv¬ 
iae  times  (a^  is  defined  such  that 


0-ao<«i<a2<  ...  <^<  Vi--  (3<6) 


When  a  job's  attained  service  falls  in  the  i**1 
interval  [a^.a^) ,  then  the  scheduling 

algorithm  followed  for  this  job  is  denoted 

where  D,  mar;  be  FCFS,  RR  or  FB.  The  disci¬ 
pline  followed  between  the  levels  is  FB.  Re¬ 
sults  for  these  1C.  systems  are  reported  in 
(14)  for  arbitrary  B(t)  (with  seme  tional 
restrictions  on  B(t)  when  D.  -  wf  for 
i  >  2) .  1 

Note  that  foe  FCFS  system  offers  no  discrimina¬ 
tion  based  on  attained  serviae  time, -whereas  the 
FB  system  discriminates  as  much  as  possible  on 
this  basis.  The  RR  system  is  "fair"  in  foe  sense 

*It  is  also  true  that  we  obtain  the  identical  W(t) 
for  the  last-come- first-served  system  (LCT8) . 


Fig.  3.1._A  Set  of  Response  Curves  for 
H/M/l,  t  -  1.0,  X  -  0.75,  p  »  0.75. 


One  micht  naturally  inquire  as  to  whether  these 
curves  are  confined  to  any  particular  region  in 
the  (W(t),t)  plane.  The  answer  is  definitely 
yes,'  and  we  develop  these  and  other  constraints 
in  foe  next  section. 

4.  RESULTS 

In  this  section  we  present  results  concerning 
the  response  functions  (W(t) )  which  are  feasible 
when  the  scheduling  discipline  is  based  only  on 
attained  service  times  and  elapsed  waiting  times 
of  jobs.  In  Section  4.1  below  we  describe  several 
fundamental  characteristics  of  W(t)  and,  in  par¬ 
ticular,  we  give  a  conservation  relationship  which 
the  response  function  must  satisfy.  In  Sections 
4.2  and  4.3,  ti^nt  lower  and  ifqper  bounds  are  de¬ 
rived  for  response  functions  in  foe  sense  that  for 
any  W(t) ,  w^t)  <  W(t)  <  wu(t) . 

In  fact,  if  the  reader  looks  at  this  figure  and 
squints  his  eyes,  he  can  almost  guess  foe  shape 
of  such  bounds. 
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4.1.  A  Mono  tonicity  Property  and  a 
Conservation  j 57  for  W(t) 

We  are  considering  scheduling  disciplines  in 
which  each  job  is  characterized  by  (1)  its  attained 
service  time,  tg  and  (2)  its  elapsed  waiting 
time,  t^.  Therefore,  the  state  of  the  system  is 
the  nutrber  of  jobs  in  the  system  and  t_  and  t^ 
for  each  job.  A  particular  scheduling  discipline 
nay  effectively  ignore  one  or  both  of  these  para¬ 
meters,  but  this  information  is  assured  to  be 
available  for  each  job.  Because  scheduling  deci¬ 
sions  are  made  only  on  the  basis  of  these  two  para¬ 
meters,  the  following  statement  is  self-evident. 

The  history  of  a  job  requiring  t^  >  t  seconds  of 
service  from  the  time  of  its  arrival  at  the  system 
until  it  has  received  t  seconds  of  service  is 
independent  of  the  exact  value  of  ti .  A  direct 
consequence  of  this  ./fact  is  that  W(t)  is  a  non- 
decreasing  function  or  equivalently 


W*  (t)  =  2jiiL>  o 


J  <w*(t)  +  1)(T  -  t)dt 


-  (t  -  t)  (W(t)  +  t)  I  +  J  (W(t)  +  t]dt 
0“  0 


-  J  tw(t) 


+  t]dt 


Substituting  into  Eq.  (4.3) 

00  J 

"■*//.  (W(t)  +  t)dt  <33  (t) 

0  0 

Again  changing  the  order  of  integration 


In  deriving  W,(t)  and  Wu(t)  we  shall  need 
another  result  which  is  given  below.  Fran  [8]  we 
have  that 


m  oo 

W-  \J  [W(t)  +  t]f  dB(T)dt 


n(t)  -  Ml  -  B(t)]  JW'(t)  +  1]  (4.2) 

where  n(t)  is  the  density  of  jobs  in  the  system 
with  t  seconds  of  attained  service  time.  We  de¬ 
fine  the  "work”  in  the  system  at  time  t  as  the 
additional  time  required  to  enpty  the  system  if  no 
new  arrivals  are  permitted  entry;  this  is  -also  re¬ 
ferred  to  as  the  "unfinished  work"  and  as  the 
"virtual  waiting  time."  the  mean  work  (7  in  the 
^stn  con  be  eaqpressed  as 


0*  y  n(t)E  [remaining  service  time  for 
•i-  a  job  with  attained  service 

time  of  t]dt 


or  w«/  n(t)/  (T  -  t)  dt 

0“  t 


00 

-  IW(t)  +  t][l  -  B(t)  ]dt 


But  in  general,  we  have  that 


/  til  -  B(t)]dt  -  tj- 


Mxeover,  the  mean  work  in  the  system  is  known 
[11]  to  be 


i(l  -  o) 


Thus  we  have  the  following  conservation  laws  for 
T(t)  and  W(t)« 


Susscituting  from  (4.2) 

\f  (W'(t)  ♦  lij  (T  -  t)cB(i)dt 
O'  0 


2(1-  p)  "  /  T(t)  [1  -  B(t)  ]dt  (4.5) 
0 


By  changing  the  order  of  integration 


(W*(t)  +  1)  (t  -  t)dtjdB(T)  (4.3) 


integrating  the  inner  integral  fay  parts 


2  • 

Hl^  p)  m  f  w(t)  [1  -  B(t)]dt  (4.6) 


We  refer  to  Eqs.  (4.5)  and  (4.6)  as  Conserva¬ 
tion  Laws  since  they  are  based  on  the  conservation 
of  average  unfinished  work  in  the  system.  This 
plaoes  an  integral  constraint  an  W(t)  (and  T(t) ) 
as  a  second  necessary  condition,  regardless  of 
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scheduling  discipline.  The  implies  tione  of  the 
conservation  law  may  be  Been  by  recognizing  that 
[1  -  B(t)  ]  is  a  nonincreasing  function  of  t. 
Thus,  it  one  had  a  given  W(t)  as  a  result  of 
sane  scheduling  algorithms,  and  than  changed  the 
algorithm  so  as  to  reduce  W(t)  over  sore  inter¬ 
val  (0,tQ),  then  the  conservation  law  would  re¬ 
quire  that  the  new  W(t)  be  considerably  above 
the  old  value  for  same  range  above  tg.  This  fol¬ 
lows  since  the  weighting  factor,  1  -  B(t),  is 
smaller  for  larger  t. 

4.2.  The  Lower  Bound 

we  claim  that  to  minimize  W(x)  the  scheduling 
discipline  nust 

1.  never  service  jobs  with  attained  service 
time  greater  than  or  equal  to  x  while 
there  are  jobs  in  the  system  with  attained 
service  time  less  than  x,  and 

2.  never  preempt  a  job  once  it  has  been  se- 
.  1  acted  far  service  until  it  has  at  least 

x  seconds  of  attained  service  time. 

Under  these  conditions  the  response  function  in 
the  interval  (0,x)  is  just  the  response  function 
for  a  nonpreenptive  system  with  service  times 
truncated  at  x.  For  convenience  We  will  assume  a 
FCFS  scheduling  discipline.  In  this  case  the  re¬ 
sponse  function  (denoted  Wp^pg.^t)}  has  the 
form  shown  in  Fig.  4.1  (see,  far  exwple,  (14]). 
Note  that  W^cre_x(t)  -  0  over  (0,x) .  The  sched¬ 
uling  of  jabs  with  attained  service  time  greater 
than  x  is  of  no  concern  in  this  argument  as  long 
as  condition  1  is  maintained. 


Let  \  be  the  mean  work  in  the  system  exclud¬ 
ing  work  to  be  done  on  jobs  beyond  providing  x 
seconds  of  attained  service  to  each.  Xn  other 
words,  if  a  job  requires  t  >  x  seoonds  of  serv¬ 
ice  and  has  received  y  <  x  seoonds  of  serviot, 
its  contribution  to  (L  is  x  -  y.  By  the  sens 
method  used  to  derive  Bq.  (4.5)  it  can  be  shown 
that 

ffx  -  y*  Pf(t)  t]  (1  -  B(t))dt 
0 


Now  since  WrcFS_x(t)  has  minimum  slope  (i.e.,  0) 
ever  the  interval  (0,x) ,  and  due  to  the  mono  ton¬ 
icity  given  in  Bq.  (4.1) ,  if  any  other  response 
curve  W(t)  is  such  that  W(x)  <  wFCFS_x(x)  it 
must  be  such  that  W(t)  <  WrcFS_x  ( t)  for 
0  <  t  <  x.  But  under  condition  1  above,  Wx  has 
its  miniitun  value  since  work  in  this  class  is  con¬ 
tinuously  decreased  at  maximum  rate  whenever  there 
is  such  work  in  the  system.  Therefore,  for  any 
W(t), 

rx 

X/  (W(t)  ♦  t]  (1  -  B(t)dt 

0 

rx 

-  7  ^OTS-X^  +  t)  l1  -  »(t)]«t 

0 

Thus  we  conclude  that  W(t)  <  Wfcps-xCt)  in  (0,x) 
is  inpossible  and  therefore  W(x)  >  Wn7S_x(x) . 

The  lower  bound  Wt(t)  is  given  by  the  waiting 
time  for  the  FCFS  discipline  with  the  service 
times  truncated  at  t,  namely  [14] 

% 

V«  -  irr^T  <‘-7> 

Note  that  WjjJO)  ■  0  and  that  W^(-)  -  W;  also 

w£(0)  -  w£(-)  -  0. 

4.3.  The  Upper  Bound 

In  this  case  we  begin  with  a  discrete  time  sys¬ 
tem. 

Assure  that  the  service  time  distribution  is  of 
the  form 


Pr (service  time  ■  kq)  »  k  ■  1,2,3,... 

where  q  is  the  quantizn  as  discussed  in  Section  2. 
Therefore,  the  only  possible  service  time  require¬ 
ments  are  multiples  of  q.  We  shall  also  assume 
that  arrivals  may  take  place  only  during  the  in¬ 
stant  before  the  end  of  a  quantum  and  that  the 
processor  is  assigned  to  a  j dj  for  a  quantum  at  a 
time.  The  probability  that  an  arrival  takes  place 
at  the  end  of  a  quantum  is  Xq  so  that  the  mean 
arrival  rate  is  X.  It  should  be  clear  that  any 
continuous  service  time  distribution  can  be  ap¬ 
proximated  arbitrarily  closely  by  a  discrete  time 
distribution  by  letting  q  approach  0.  Also, 
these  restrictions  on  the  service  discipline  and 
arrival  mechanism  are  effectively  eliminated  when 
q  0.  In  this  discrete  time  model  our  goal  is  to 
maximize  H(kq) . 

He  claim  that  the  following  scheduling  rule  is 
necessary  and  sufficient  to  maximize  W(kq) :  no 
allocation  of  a  k^i  quantum  is  made  to  ary  job 
when  there  is  sane  other  jab  in  the  system  waiting 
for  its  j™  quantum  where  j  +  k.  He  note  in  pass- 
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We  relabel  the  tine  axis  so  that  t  *  0  at  an 
arbitrary  point  in  sore  idle  period.  The  tines  at 
which  sane  jab  is  allocated  a  Xth  quantum  we  call 
"critical  times."  Let  c^  be  the  tine  that  the 
1th  critical  time  occurs.  We  wish  to  maximize 
C£  (the  average  of  Cjj  for  sane  fixed  l,  and 
we  will  show  that  to  accomplish  this  it  is  neces¬ 
sary  and  sufficient  to  satisfy  the  condition  that 
at  the  1th  critical  time  no  job  is  waiting  for  a 
j™  quantum  where  j  k.  Certainly  this  condi¬ 
tion  is  necessary  since  if  a  proposed  scheduling 
discipline  did  not  have  this  property  then  c^ 
can  easily  be  increased  when  the  condition  is  not 
satisfied  as  follows:  follow  the  proposed  sched¬ 
ule  until  the  point  where  the  1th  critical  time 
would  occur  and  then  assign  a  quantum  to  a  job 
waiting  for  its  j“*  k)  quantum. 

Since  we  have  already  shown  necessity,  to  prove 
tin  sufficiency  of  the  condition  for  maximizing 
cv  we  need  only  show  that  any  schedule  satisfy¬ 
ing  the  condition  yields  the  same  value  for  c£. 
Let  A  be  any  scheduling  algorithm  which  satis¬ 
fies  the  rule  that  at  the  £“*  critical  time  no 
job  is  waiting  far  a  j™  quantum  where  j  +  k. 

Let  be  the  time  at  which  the  IIth  job  arrives 
which  will  require  at  least  kq  seconds  of  serv¬ 
ice.  The  state  of  the  system  at  a(  will,  in 
general,  depend  on  the  algorithm  A.  In  particu¬ 
lar,  the  number  of  critical  times  that  have  oc¬ 
curred  prior  to  (let  this  be  s)  is  a  func¬ 
tion  of  A.  Let  E&[C£  -  a j | state  of  syBtxm  at 
at]  be  the  expected, value  of  c^  -  a£  under  algo¬ 
rithm  A  conditioned  on  the  state  of  the  system 
at  ag.  The  state  of  the  systan  is  given  fay  the 
number  of  jebs  in  the  system,  the  attained  service 
time  of  each  job  in  the  system  and  s,  the  number 
of  critical  times  that  have  occurred.  Thus,  we 
have 

E^(cji  -  a^l  state  of  systems  at  a^] 

■  E.  [remaining  work  in  system  not 

A  requiring  a  k™  quantum  (state 
of  system  at  a^J 

+  (i  -  e  —  1)E [remaining  service  time  for 
job  with  (k  -  l)q  eeoonds 
of  attained  service] 

+  (k  -  l)q 

+  lt(jt_1)qEA[cJl  -  a^  |  state  of  the  system 

at  a^l  (4.8) 

But  the  sun  of  the  first  two  terns  on  the  ricfit- 
hand  side  of  this  equation  is  equal  to  the  ex¬ 
pected  amount  of  work  in  the  system  at  a,  given 
the  state  at  a^.  Thus 

EAlcf  -  a^  |  state  of  syetan  at  a^J 

■  EA(wack  in  system  at  aj  state  at  a^] 

♦  <k  -  l)q 

♦  ^(k-Dq^A^i  “  of  systan  at  a^] 


Removing  the  oondition  on  the  state  of  the  systan 
at  a^  we  have 

EA[ct  -  a^]  »  EA[work  in  the  system  at  a^] 


+  (k  -  l)q  +  A tjjjc-u  q^Ic*.  “  V 


*  EAtcl  " 


EA[wark  in  systan  at  a.,]  +  (k-l)q 
1  "  Xt(k-l)q 


But  EA[vrark  in  systems  at  a^  j  is  not  a  function 
of  the  particular  scheduling  algorithm  and  there¬ 
fore  E^Ic^  -  a£]  does  not  depend  on  A.  Since 

EfcjJ  =  Efc^  -  a^]  +  Eta^]  and  the  right-hand 
side  is  independent  of  A,  E[ct]  is  independent 
of  A.  Note  that  the  form  of  Eq.  (4.8)  depended 
on  A  having  the  property  that  at  there  are 

no  jobs  in  the  systan  waiting  for  a  j**1  quantum 
where  j  j*  k-  We  have  now  shewn  that  this  condi¬ 
tion  is  necessary  and  sufficient  to  maximize 
E[ct)(-  ct) . 

We  now  show  teat  the  general  scheduling  rule  to 
maximize  W(kq)  is  the  Fame  rule  which  maximizes 
c  applied  for  all  £.  We  have 


£  cz  -  2  a^ 

W(Joq)  -  lim  — - — ^ —  (4.9) 

n*» 


The  are  independent  of  the  scheduling  disci¬ 
pline  and  the  proposed  scheduling  rule  is  neces¬ 
sary  and  sufficient  to  individually  maximize  the 
c g.  therefore,  the  same  rule  is  necessary  and  suf¬ 
ficient  to  maximize  W(kq) ,  which  establishes  our 
earlier  claim. 

It  should  be  clear  that  in  a  continuous  time 
system  wb  can  approach  the  maximum  of  W(x)  by 
the  following  rule:  no  job  with  attained  service 
time  in  the  open  interval  (x  -  e,x)  (for  c  >  0) 
is  serviced  while  there  is  a  job  waiting  for  serv-  , 
ice  which  has  attained  service  time  outside  this 
interval.  By  permitting  e  to  shrink  to  zero,  we 
approach  tee  maximum  far  W(x) . 

One  scheduling  discipline  which  maximizes  W(x) 
is  the  two-level  systan  in  which  jobs  are  served 
PCFS  in  tee  first  level  up  to  x”  seconds  of  at¬ 
tained  service.  A  job  which  does  not  finish  is 
placed  in  the  second  level  queue.  Die  second 
queue  is  serviced  PCFS  to  ccnpletion.  Die  second 
queue  has  a  lower  priority  and  is  only  serviced 
when  the  first  queue  is  enpty  (see  the  ML  systems 
desaribed  in  Section  3) .  Dlls  queueing  system 
satisfies  tee  oondition  for  maximizing  w(x)  and 
therefore  from  [14]  we  have 

*u(t> "  w~=  -  p) +  r=7;  (4-10) 
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Note  that  Wu<0)  -  W  -  Wt(«) , .  that  W£(0)  -  0, 
and' that  W^(»)  -  p/(l  -  p). 

4.4.  Sunrary 

tn  Hi  t «*  n»»*Hnn  w*>  li.ive  p«(  ahlislwl  Uie  follcw- 
mv  a*.y  toHwantlutvi 


T*  it)  >1 

(4.11) 

W'(t)  >  0 

(4.12) 

m 

7 

f  T(t)(l 

• »“»)“- iir^r 

(4.13) 

0 

m 

ri. 

f  W(t)  [1 

(4.14) 

0 


(4.15) 


5.  EXAM’IZS 

Four  exanples  are  given  in  this  section  to 
demonstrate  the  nature  of  the  tight  bounds  we  have 
obtained.  As  a  performance  measure,  the  equili¬ 
brium  average  waiting  times,  W(t) ,  are  plotted 
as  a  function  of  t.  Vie  begin  with  the  IVV1  sys¬ 
tem  (i.e. ,  Poisson  arrivals  and  exponential  serv¬ 
ice)  .  The  response  functions  of  Fig.  3.1  are 
given  again  in  Fig.  5.1  with  the  ipper  and  lower 
bounds  super ijTposed.  At  t  -  0,  the  ipper  bound 
and  FCFS  start  at  the  same  point  because,  under 
the  constraint  of  the  conservation  law,  no  other 
scheduling  algorithm  can  give  longer  average  wait¬ 
ing  time  at  t  »  0  than  FCFS.  The  upper  bound 
approaches  the  FB  response  asymptotically  as  t 
approaches  infinity;  therefore,  a  customer  with  a 
very  long  requested  service  time  (as  compared  to 
the  mean)  cannot  be  delayed  much  more  than  he  is 
with  FB.  The  lower  boind  starts  at  aero  (as  does 
the  FB  curve) ,  increasing  less  rapidly  with  t 
than  the  vpper  bound.  It  approaches  the  FCFS  curve 
asymptotically  as  t  goes  to  infinity.  Thus  we 
note  that  the  least  discriminating  scheduling  algo* 
rithn  (FCFS)  touches  the  vpper  bound  at  t  ■  0  and 
forms  the  asymptote  for  the  lcwer  bound  as  t 
approaches  infinity;  conversely,  the  most  discrimi¬ 
nating  scheduling  algorithm  (FB)  touches  the  lower  . 
bound  at  t  =  0  and  forms  the  asymptote  for  the 
upper  bound  as  t  approaches  infinity.  The  above- 
mentioned  behavior  of  the  upper  and  lower  bounds 
applies  not  only  for  the  fVM/1  system,  but  also 
holds  true  for  any  M/G i/1  system  in  general,  al¬ 
though  the  rate  of  convergence  for  the  bounds  to 
their  respective  limits  varies  for  different  serv¬ 
ice  distributions. 


Fig.  5.1.  Bounds  on  Response  far  HW1, 
t-  1.0,  \  m  c.75,  p  -  0.75. 


for  the  second  example  we  chooee  the  system 
H/Eyl.  In  this  system  we  have 

-  (2p)2xe-2ux  x  >  0  (5.1) 


with  mean  service  time  equal  to  1/us  the  second 

moment  of  this  distribution  is  3/2v?-  Because 
the  second  moment  is  smaller  than  that  of  the  ex- 

ponential  distribution  (whose  value  is  2/u  ) , 
the  bounds  aze  tighter  in  this  exanple  than  the 
case,  just  as  one  would  expect.  Fig.  5.2 
shows  the  behavior  of  this  system  with  p  ■  1 
and  X  -  0.75.  It  is  obvious  from  the  figure  that 
for  t  >  5/u,  the  upper  and  lower  bounds  have  es¬ 
sentially  reached  their  asymptotic  form. 

In  the  third  example  we  show  foe  bounds  for  foe 
IVH^l  *y»ten,  where  Hj  stands  for  hyperexponen- 
tial  service  distribution  with 

dB  lx)  — UjJC  "WjH 

■  0. 5)^6  *  +  0.5w2e  x  >  0  (5.2) 


We  choose  ■  5p,  Vj  ■  (5/9)u,  resulting  in  a 

mean  aervioe  time  of  1/p.  The  second  moment  of 

this  distribution  is  82/25 u2.  Fig.  5.3  shows  the 
behavior  of  foe  M/H^/l  system  with  p  -  1  and 

X  m  0.75.  The  upper  and  lower  bounds  approach 
their  respective  limits  et  a  slower  rate  than 


i 
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and  X  -  0.1875,  t  -  4.0,  p  -  0.75.  Fig.  5.4 
shows  the  behavior  of  this  system.  Notioe  that 
when  t  >  6,  the  ®per  bound  coincides  exactly 
with  the  FB  curve  and  that  the  lower  bound  coin¬ 
cides  exactly  with  the  FCFS  curve.  The  probabil 
tty  of  having  any  customer  requesting  more  than 
six  seconds  of  servioe  in  this  example  is,  of 
oourse,  equal  to  aero. 

UMC*90UN0-\ 

-r 

Fig.  5.2.  Bounds  on  Response  for  M/Ej/l 
t  -  1.0,  X  -  0.75,  p  -  0.75. 


Fig.  5.4.  Bounds  on  Response  for  M/V/l, 
t-  4.0,  X  -  0.1875,  p  -  0.75. 

Another  performance  measure,  W(t)/t,  is  given 
in  Fig.  5.5  for  the  IVH^l  case  and  is  of  interest 
to  us,  since  (as  mentioned  in  Section  2)  it  gives 
some  feeling  for  how  large  a  price  (in  terra  of 
wasted  time)  a  customer  must  pay  in  order  to  get 
a  unit  of  service  time.  Ftor  the  case  of  RR,  this 
measure  is  a  oonotanti  thus  each  customer  has  the 


Fig.  5.3.  Bounds  on  Response  for 

t  -  1.0,  X  -  0.75,  p  -  0.75.  * 


either  M/M/ 1  or  M/t^l  because  of  the  larger  sec¬ 
ond  moment.  * 

For  our  last  example  we  choose  the  system  M/tyl 
where  U  stands  far  uniform  service  distribution. 
Far  this  particular  example  we  have 


[0.25  2<x<6 


otherwise 


Fig.  5.5^  Bounds  on  Penalty  Rate  for  Htyl 
t  ■  1.0,  X  -  0.75,  p  ■  0.75. 


sane  penalty  rate,  regardless  of  his  service  time* 

In  this  senso,  everyone  is  treated  equally  in  the' 
l»f>  r./sUm.  The  curve  reifresentiny  PCFS  Is  nonoton- 
i  •  m  1 1  y  <V.<  ,on.qiiK^  wi  Hi  I,  o,k1  wi  i  lm  tfn>r>r  j'Jm 
l  <a'f  ul  n  aim  it  mi  l*ninlly  inln)  III  lliln  rnhn,  n'/nluil 
uaeis  might  atUai|jt  lo  "pool"  Uieir  raquuuts  to 
take  advantage  of  this  “quantity  discount."  An¬ 
other  extreme  example  is  provided  by  5B;  W(t)/t 
increases  rapidly  when  t  is  snail,  then  drops 
slowly  to  a  constant  (p/(  1-p)) .  A  customer  with 
a  long  request  can  do  better  by  breaking  his  jeto 
into  smaller  independent  jobs  and  submitting  them 
separately  to  the  system  (if  this  is  possible)  be¬ 
cause  then  the  penalty  rate  will  be  greatly  reduced. 

Fig.  5.6  shows  the  range  of  the  bounds  for  the 
M/M/  1  system  with  p  -  0.75,  0.5  and  0.25,  respec¬ 
tively.  As  can  be  seen,  the  region  included  be¬ 
tween  the  upper  and  lower  bounds  for  a  particular 
utilization  factor  p  depends  heavily  on  pi  the 
larger  the  value  of  p,  the  greater  is  the  verti¬ 
cal  separation  between  the  two  bounds,  thus  allow¬ 
ing  larger  variation  of  the  mean  waiting  times  for 
different  scheduling  algorithms. 


Fig.  5.6.  Variation  of  BoundB  for  M/M/1 
with  p  -  0.25,  0.50,  0.75. 


6.  EXTENSIONS 

As  we  implied  in  the  Introduction,  we  have  ans¬ 
wered  some  fundamental  questions  regarding  the 
existence  of  order  and  structure  in  the  analytical 
results  for  time-shared  oaiputer  systems.  Our 
principal  results  are  given  as  a  mono  tonicity  con¬ 
dition  (Eqs.  (4.11,4.12)),  a  conservation  law  (Eqs. 
(4.13,4.14))  and  tight  upper  and  lower  bounds  (Eq. 
(4.15))  on  the  response  function  W(t) .  These  re¬ 
sults  are  exerplified  by  the  curves  given  in  Sec¬ 
tion  5.  We  note  here  that  although  the  results 
were  expressed  for  processor-shared  system,  the 
sane  type  of  results  apply  to  the  case  q  >  0. 

We  nd^nt  observe  seme  additions!  properties 


which  follow  from  our  results.  First  we  see  that 
any  Wft)  may  touch  the  lower  bound  at  mo6t  once 
(except;  over  the  semi-infinite  interval  <  t 

wU'ii  11(1^)  -  1) ;  the  same  may  be  said  for  the 
ippor  bound. 

Secondly,  we  find  that  we  are  able  to  respond 
to  the  following  kind  of  specification.  Suppose 
that  a  designer  requests  that  all  jobs  of  duration 
tit*  should  liave  an  average  wasted  time 
W(t)  <  W*.  Then  if  W*  >  W4(t*),  it  is  possible 
to  guarantee  at  least  this  behavior  (for  example, 
by  an  ML  system  where  the  first  level  is  PCFS  out 
to  t*) .  Such  a  specification  seems  to  us  to  be 
quite  natural.  The  next  obvious  need  is  to  specify 
the  bonds  on  W(t)  which  exist  for  t  >  t*. 

Lastly,  we  poee  the  more  general  question  which, 
at  the  time  of  this  writing  remains  unsolved, 
namely,  what  are  the  necessary  and  sufficient  con¬ 
ditions  for  a  given  response  function  to  be  feas¬ 
ible?  This  paper  has  presented  some  inpartant 
necessary  conditions. 
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ABSTRACT 

A  study  is  made  of  routing  techniques  applicable  to 
store-and-forward  ocnputer  networks  (e.g.,  the  A»A 
Network)  in  order  to  show  their  inportanoe  in  relation 
to  the  theoretical  design  of  these  networks  and  to  the 
performance  of  existing  networks.  The  major  attempt  has 
been  to  classify  routing  techniques  and  to  specify  their 
parameters  as  vril  as  a  means  of  evaluating  their  per¬ 
formance.  Using  average  message,  delay  as  a  measure  of 
network  performance,  a  nurber  of  routing  techniques  are 
oompared  via  theoretical  and  cccputer  simulation  results. 

i.  nmgggrKK 

This  paper  considers  message  flow  in  a  specific  class 
of  networks  denoted  as  store-and-forward  computer-com¬ 
munication  nets.  Such  nets  accept  message  traffic  from 
external  sources  (ccmputers)  and  transnit  this  traffic 
over  sane  route  within  the  network  to  the  destination; 
this  transmission  takes  place  over  one  link  at  a  time, 
with  possible  storage  of  the  message  at  each  intermedi¬ 
ate  switching  node  due  to  congestion.  One  of  the  funda¬ 
mental  problems  in  these  nets  is  the  routing  of  messages 
in  an  orderly  manner  to  insure  their  rapid  delivery. 

The  requirements  for  such  a  systas  differ  considerably 
firm  those  of  the  telephone  system  enploying  circuit  or 
line  switching  and  from  those  of  military  acrmunicatiai 
networks  required  to  operate  in  extronely  hostile  en¬ 
vironments. 

The  study  cf  routing  techniques  is  important  because 
of  the  central  role  they  play  in  the  design  and  opera¬ 
tion  of  low  cost  oarputar-onmiwication  nets.  The  ab¬ 
stract  design  of  a  low  cost  acnputer-ooranunicaticn  net¬ 
work  was  first  stated  by  Kleinrock3-1  as  follows; 

minimize  T  (the  average  message  delay) 

over  the  link  capacity  assignment  ) 

design  message  priority  discipline  ( 

variables  routing  doctrine  I  (1) 

,  topology  * 

subject  to 

a  suitable  ooet  criterion  and  external 
traffic  requirement 

All  of  the  design  variables  are  interdependent  and  a 
general  solution  technique  is  unknown,  although  signifi¬ 
cant  jgs  been  made  for  sane  interesting  special 

Before  the  general  solution  of  Eq.  (1)  can  be  under¬ 
taken,  it  is  important  to  determine  how  the  variation  of 
the  design  parameters  in  this  equation  influences  the 
average  message  delay  T.  Here  we  address  the  routing 
doctrine  question.  Key  areas  which  require  study  are: 
what  should  a  routing  technique  achieve;  how  can  routing 
techniques  be  classified;  how  are  routing  algorithms 
specified;  what  are  the  appropriate  performance  measures 
and;  how  are  routing  algorithms  evaluated?  Below,  we 


•Bus  work  was  supported  by  the  Advanced  Research 
Projects  Agency  of  the  Department  of  Defens*  (DAHC-15- 
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attsrpt  to  answer  these  questions  in  relation  to  the 
selected  ocrputer-aornunicaticn  network  model. 

H.  THE  OOMFOTER-OOEMJNIOgTOJ  NET 

In  order  to  properly  characterize  that  an  adaptive 
routing  technique  (algorithm)  should  achieve,  the  uni¬ 
verse  in  which  it  operates  must  first  be  specified. 

This  requires  a  characterization  for  conputer-ccnTOnica- 
tion  networks . 

The  class  of  networks  considered  in  this  paper  can  be 
depicted  as  shown  in  Figures  1*  and  2  and  are  modeled 


Figure  1.  Nutwoffc  Topology 

The  characteristics  of  the  network  model  are: 

1.  Each  pair  of  nodes  (N^,Nj)  an  be  connected 
by  at  meet  one  dedicated  high-quality  (low 
error  rate)  full  dtplex  digital  ccnrunicatian 
line. 

2.  Each  caimunicaticn  link  has  fixed  capacity. 

3.  Each  node  has  finite  storage  and  operates  in 
a  store-and-forwerd  fashicn. 

4.  Satellites  are  not  utilized  as  nodes.6 

The  basic  wit  of  information  passed  between  any  pair 
of  nodes  is  called  a  "packet"  with  maxiimm  size  of  ap¬ 
proximately  1000  bite.  When  a  packet  is  received  at  a 
nods,  it  is  stored  and  checked  for  errors  via  an  error 
detecting  code.  If  correct  and  if  this  node  is  willing 
to  "accept"  the  packet,  then  a  positive  acknowledgment 
is  sent  back  to  the  preceding  node  indicating  this  fact; 
otherwise,  a  negative  acknowledgment  is  sent  back  (nega¬ 
tive  acknowledgments,  however,  are  not  used  in  the  AHPA 
network) .  When  a  node  receives  a.  positive  acknowledg¬ 
ment,  it  destroys  its  aopy  of  the  packet;  otherwise  the 
padeet  is  retransmitted.  If  a  packet  is  not  destined 
for  the  node  at  which  it  was  received,  it  is  relayed 
(routed)  further  along  its  path  to  a  neighboring  node. 


"The  AK>A  network  topology  has  since  changed  signifi¬ 
cantly.  However,  we  continue  to  utilize  it  in  order  to 
oonpare  our  current  simulation  and  theoretical  results 
with  those  contained  in  Rafs.  12  and  13. 


The  routing  procedure  determines  the  path  a  packet  tra¬ 
verses  from  a  source  node  ^  to  a  destination  node  Hp. 

For  exanple,  the  paths  *  (5,6,16,8)  and  *2  m  (5,4,17, 
8)  are  two  of  the  many  possible  paths  from  Ng  -  5  to 
N0  •  8  as  shown  in  Figure  1. 

The  assisted  internal  structure  of  a  node,  shown  in 
Figure  2,  consists  of  a  store-and-forward  switch  re¬ 
ferred  to  as  an  IM»  (Interfaoe  Message  Processor)  and  a 
HOST  (external  caiputer  system) .  The  function  of  the 
09  is  to  allocate  storage  for  incoming  packets,  perform 


B  -  SWITCH  WHICH  CAN  BLOCK  TRAFFIC 
FLOW  TO  THE  IMF 

CPU  -  CENTRAL  PROCESSING  UNIT  ROUTINE 
a  -  QUEUE 

S  -  SERVER  (REPRESENTS  THE  FINITE  RATE  OF  TRANS¬ 
MISSION  ON  THE  OUTPUT  LINKS ) 

F%u»2.  WilMilMMn 


2.  The  routing  tednique  should  adapt  to  changes 
in  the  network  topology  resulting  fztxn  node 
and  aamuiicaticn  link  failures. 

3.  The  routing  technique  should  adapt  to  varying 
source-destination  traffic  loads. 

4.  Packets  should  be  routed  around  nodes  that  are 
congested  or  temporarily  blocked  due  to  a  full 
storage. 


in.  CLASS  IFICKITCN  OF  ROUITOG  TECHNIQUES 

It  is  desirable  to  classify  network  routing  tech¬ 
niques  in  order  to  gain  insight  into  their  structure, 
oceplexity  and  performance;  from  this,  one  may  then  com¬ 
pare  them  as  candidates  for  operational  nettork  algo¬ 
rithms.  The  two  major  classifications  selected  are 
(1)  deterministic,  and  (2)  stochastic  techniques.  Deter¬ 
ministic  routing  techniques  ccnpute  routes  based  upon  a 
given  deterministic  decani cn  rule  and  produce  a  loop- 
free  routing  procedure  (i.e. ,  packets  cannot  become 
trapped  in  closed  paths) .  Stochastic  techniques,  on  the 
other  hand,  operate  as  probabilistic  decision  rules,  uti¬ 
lizing  topology  and  either  no  information  about  the 
statu  of  the  network  (random  routing)  or  estimates  of 
the  present  state  of  the  network.  With,  these  techniques, 
packets  may  be  trapped  in  loops  for  short  time  periods. 
Figure  3  shows  a  more  cosplete  classification  of  the  ap¬ 
plicable  routing  techniques. 
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routing  for  packets  which  must  be  relayed,  acknowledge 
accepted  packets  and  perform  other  routine  functions 
(i.e.,  packet  error  checking,  circuit  fault  detection, 
traffic  maasurenant,  etc.) .  In  addition,  the  CPU  rou¬ 
tine  can  block  incoming  messages  from  its  HOST  when  suf¬ 
ficient  storage  is  unavailable. 

Messages,  which  originate  at  a  HOST,  have  a  maximum 
length  of  approximately  8000  bits.  The  IMP  segments  a 
HOST'S  message  into  packets  (i.e.,  as  marry  maximum  sized 
packets  as  necessary,  plus  a  "remainder"  packet) .  These 
packets  are.  then  handled  by  the  network  as  independent 
entities  until  they  reach  their  destination  node.  There 
the  packets  of  a  message  are-  collected  and  the  message 
is  reassembled  before  it  is  transferred  to  the  destina¬ 
tion  HOST.  Messages  which  consist  of  only  a.  single 
packet  are  given  higher  priority  than  rrulti  -packet  mes¬ 
sages  so  that  the  network  can  support  interactive  users. 

Using  this  network  model,  the  message  routing  require¬ 
ments  for  the  ooKputer-ooramnicatitn  network  can  be  sim¬ 
ply  stated: 

1.  Massage  routing  should  insure  rapid  and  error- 
free  delivery  of  messages. 
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Deterministic  Techniques 

Urn  four  basic  deterministic  techniques  are: 

1.  Flooding.  Each  node  receiving  or  originating  a 
packet  transmits  a  aopy  of  it  over  "all”  outgoing  links 
or  over  a  set  of  "selective"  outgoing  links;  this  trans¬ 
mission  occurs  cnly  after  the  node  has  checked  to  see 
that  it  has  not  previously  transmitted  the  packet,  or 
that  it  is  not  the  destination  of  the  packet.  This  tech¬ 
nique  has  been  discussed  by  Boehm  and  Hsbley.3  Their  • 
conclusion  is  that  the  inefficiency  of  this  technique  is 
tolerable  if  one  has  only  a  few  messages  to  deliver. 
However,  a  large  volume  of  ocmnunicaticns  traffic  neces¬ 
sitates  more  efficient  routing  techniques.  Another  draw¬ 
back  to  this  technique  is  that  each  node  requires  a  mech¬ 
anist)  to  raoognia  previously  transmitted  messages. 

2.  Fixed  touting.  Fixed  routing  algorithms  specify  a 
unique  path  ir(Ng,  ...,  Np)  (route)  followed  by  a  packet 
which  depends  cnly  upon  roe  source-destination  node  pair 
(1%,Nq)  .  TO  aooonplish  this,  each  node  has  a  routing 
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table  similar  to  that  shown  in  Figure  4. 

If  a  packet  must  be  relayed,  its  destina¬ 
tion  is  used  to  enter  the  routing  table. 
The  entry  contained  in  the  routing  table 
specifies  the  next  unique  node  in  the 
packet's  oath.  Kleinrock12'13  and 
Prosser^  have  examined  several  of  these 
techniques.  Fixed  routing  techniques  re¬ 
quire  completely  reliable  nodes  and  links , 
except  for  the  occasional  retransmission 
of  a  packet  due  to  channel  bit  errors. 
However,  they  do  allow  for  highly  effi¬ 
cient  high  volume  traffic  flow  and  are 
very  stable. 

3.  Network  Routing  Control  Center 
(NRQCjT  With  this  technique,  one  of  the 
network  nodes  is  designated  as  the  NROC. 
This  center  collects  performance  informa¬ 
tion  about  the  network  operation,  confutes 
routing  tables  and  then  transmits  the  ap¬ 
propriate  routing  table  to  each  node  in 
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the  network.  amputation  of  the  routes 
by  the  NCKR  is  done  on  a  global  basis  and  this  insures 
loop-free  paths  between  all  source-destination  node 
pairs.  Thus,  a  fixed  routing  procedure  is  maintained 
between  NROC  updates. 


lhere  are  a  number  of  drawbacks  to  this  techniqua. 
By  the  time  the  nodes  begin  using  the  new  routing  tables, 
the  performance  information  that  was  us6d  in  the  computa¬ 
tion  of  the  routing  tables  may  be  out  of  date  in  rela¬ 
tion  to  the  current  state  of  the  network.  In  addition, 
transmission  costs  and  vulnerability  beooua  significant 
considerations . 


4.  Ideal  Observer  Routing.  This  technique  is  essen¬ 
tially  a  scheduling  problem.  Each  time  a  new  packet 
enters  a  node  from  the  HOST,  its  route  is  computed  to 
minimize  its  travel  time  to  its  destination  node,  based 
ipon  the  complete  present  information  about  the  packets 
already  in  the  network  and  their  known  routes.  If  the 
ideal  observer  has  information  about  the  occurrence  of 
future  events,  then  this  information  oould  also  be  uti¬ 
lized  in  the  computation  of  the  route.  This  technique 
is  obviously  impractical  for  an  operational  network,  but 
from  a  theoretical  viewpoint,  provides  the  minimum  aver¬ 
age  message  delay  to  which  all  other  routing  techniques 
•may  be  ampazed. 


table  is  then  formed  by  choosing,  for  each  row  (say  the 
i^1  row),  that  output  line  nutber  OI^(i)  whose  value  in 
the  delay  table  is  miniigun  as  follows : 

a^,(i)  -  rain  Tj(i»I^)  (2) 

where  {L^}  is  the  set  of  output  line  mmbezs  for  node  J. 
figure  5  shows  an  exanple. 
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The  manner  in  which  the  estimates  Tj(.,  .)  are  formed 

end  updated  and  how  often  the  delay  tables  are  interro¬ 
gated  depends  upon  the  specific  structure  of  the  routing 
algorithm. 

in  the  shortest  queue  »  zero  bias  algorithm,  a  packet's 
route  is  selected  by  placing  it  in  the  shortest  output 
channel  queue.  1his  is  essentially  Baran's  Hot  Potato 
routing  concept1'2.  Since  the  route  selected  is  inde- 
pendent  of  the  packet's  destination,  the  delay  table 
would  require  only  one  row,  where  the  row  entries  would 
reflect  the  output  channel  queue  lengths.  The  non-zero 
bias  case  will  be  discussed  later  as  a  limiting  case  of 
a  distributed  routing  technique. 


Stochastic  Techniques 

The  three  basic  stochastic  techniques  are: 

1.  Randan  Routing.  Randan  routing  procedures  are 
those  decision  rules  in  which  the  choioe  as  to  the  next 
node  to  visit  is  "made  according  to  same  probability  dis¬ 
tribution  over  the  set  of  neighbor  nodes.  The  set  of 
neighbor  nodes  utilized  in  the  decision  rule  can  be  "all* 
of  the  connected  nodes  or  can  be  based  "selectively" 
over  that  set  of  nodes  which  are  in  the  general  direc¬ 
tion  of  the  packet's  destination. 

Kleinrock11  and  Prosser14  have  investigated  ntmer- 
ous  randan  routing  techniques  and  have  shown  that  they 
are  highly  inefficient  in  terms  of  message  delay,  but 
are  extremely  stable  (i.e.,  they  are  relatively  unaf¬ 
fected  by  small  changes  in  the  network  structure) . 

2.  Isolated  and  3.  Distributed  Techniques.  All  of  the 
isolated  ana  distributed  routing  algoriums*  operate  in 
basically  the  -sane  manner.  A  delay  table  is  formed  at 
each  node  as  shown  in  Figure  5"  The'  entries  Tj(D,I^) 
are  the  estimated  delays  to  go  from  the  node  under  con¬ 
sideration  (say  node  J)  to  same  destination  node  0  using 
line  Ity  as  the  next  step  in  the  path  to  0.  A  routing 


In  the  local  delay  estimate  algorithm,  a  packet's 
route  is  selected  via  Bq.  (2) .  The  delay  table  is  up¬ 
dated  after  a  packet  is  received  (say  at  node  J)  by  the 
following  scheme 

fy*6  V^new  *  Vfy^'V^old  +  VT™(S'J>  (3) 

where 

TIN(S,J)  -  the  Time  the  packet  has  spent  In  the 
Network  traveling  from  its  source  node 
S  to  the  current  node  J, 

L  ^  ■  the  reverse  (outgoing)  line  oonespond- 
N  lag  to  the  forward  (incoming)  line  1^ 
of  the  full-duplex  pair  upon  which  the 
packet  entered  node  J, 

and 

and  Kj  are  constants. 

This  technique,  called  backwards  learning,  has  been  ex¬ 
tensively  investigated  by  Baran1!  Boehm  and  Mobley3 
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(4) 


offer  modifications  to  the  basic  technique  ©3*  (3) )  to 
inprove  its  performance. 

In  the  distributed  routing  techniques  classification, 
all  routing  algorithms  utilize  the  sane  basic  techniques 
to  aonpute  and  tpdate  the  delay  table  estimates,  but  the 
instants  at  which  these  tables  are  updated  and  the  route 
selection  procedure  differs  depending  upon  the  particu¬ 
lar  structure  of  the  algorithm. 

There  are  basically  two  different  mechanisms  which 
cause  entries  in  the  delay  tables  to  change:  (1)  as 
packets  are  placed  an  or  taken  off)  an  output  channel 
queue,  all  delay  table  entries  in  a  colum  correspond¬ 
ing  to  that  output  line  must  be  increased  (or  decreased) 
to  reflect  the  change  in  expected  delay  for  the  channel; 
and  (2)  when  delay  information  froti  neighboring  nodes  is 
utilized  to  update  the  delay  table  estimates.  In  the 
latter  case,  the  following  procedure  is  used. 

Sippose  a  decision  at  node  J  has  been  made  to  inform 
its  neighbors  ( say  N^,  N,  and  N,,  as  in  Figure  6)  of  its 
current  minimum  estimated  delays  to  reach  all  nodes 
within^  the  network.  Mode  J  forms  a  minimum  delay  vector 
Vj  -  (T(l)  ,T(2)  /  . '  where  the  aonponent 

T(K)  *  min  $j(R,lty)  and  transmits  V^.  to  its  neighbors 


table  entries  are 

T^(0,I^}  -  Q(M,tjj)  +  Dp  +  T(D) 

where  Q(M,I{{)  is  the  queue  length  of  line  matter  at 
node  M.  The  aenstant  E^  can  be  in  tern  re  ted  in  two  ways. 
First,  if  its  value  equals  the  average  time  to  transmit 
a  packet  over  an  outgoing  channel,  then  neglecting  chan¬ 
nel  propagation  delays,  D-  represents  the  minimun  average 
delay  to  reach  a  neighbor  node.  Secondly,  if  the  delay 
tables  are  updated  rapidly  in  a  lightly  loaded  net,  then 
the  delay  table  estimate  Tj(D,I^)  =  N*(J,D,IW)  -C^,  where 
N*(J,D,I^)  is  the  matter  of  lines  encountered  in  the  path 
*(J,D)  when  a  packet  leaves  node  J  on  line  1^.  Thus,  by 
varying  Dl,  we  can  control  the  degree  of  alternate  rout¬ 
ing  and  sensitivity  of  the  algorithm  to  small  variations 
in  queue  lengths.  That  is,  if  E^  is  large  ccnpared  to 
the  average  queueing  delay  in  a  node,  then  the  path  chos¬ 
en  for  a  message  will  tend  to  be  one  of  the  paths  with 
smallest  N*(S,D,0 . 

There  are  two  methods  which  can  be  employed  to  cause 
the  transmission  of  the  delay  table  update  vectors  Vj: 

(1)  Tie  periodic  updating  algorithm  forces  these  trans¬ 
missions  at  a  periodic  rate  ft,  (as  is  currently  done 
in  the  AFPA  network)  and  (2)  the  asynchronous  updating 
algorithm  allows  these  transmissions  asynchronously; 
this  transmission  can  occur  after  the  pouting  of  apack- 
«t  (via  Bq.  (2))  on  line  01*,  (i)  if  Tm  (i,0I^(i) )  has 
changed  by  more  than  a  specified  amount*  (a  threshold) 
since  the  last  update  occurred.  Thus,  the  delay  \ectors 
can  percolate  throughout  the  net  in  a  short  time  period. 
If  the  threshold  value  is  excessively  large,  updating 
oeases  and  the  asynchronous  routing  schemes  reduce  to  the 
shortest  queue  +  bias  a^porithms  (with  bias  equal  to  Dp) . 

The  choice  cf  routes  is  determined  as  follows:  If  the 
update  mechanism  is  periodic,  then  the  set  of  routes  ob¬ 
tained  vita  Eq.  (2)  is  held  fixed  until  the  tables  are 
again  updated;  in  the  asynchronous  case,  Eq.  (2)  is  used 
to  determine  the  route  of  each  packet. 

Of  all  the  stochastic  techniques,  the  distributed 
routing  algorithm  are  the  most  efficient  for  handling 
line  and  node  failures .  Cnee  a  failure  is  determined 
(see  ftef .  9  for  procedures  utilized  in  the  MPA  network) , 
the  proper  entries  in  the  node  delay  tables  can  be  forced 
to  main  excessively  large  as  long  as  the  failure  per¬ 
sists. 


VECTOR  Yf 


(Np  and  Nj) .  upon  receipt  of  a  minimum  delay  vec¬ 
tor,  a  node  (for  example  Nj)  adds  its  current  output 
line  queue  length  (line  1^  for  this  exanple)  plus  a 
constant  E^  to  all  entries  in  the  vector  Vj  and  replaces 
colum  1  (corresponding  to  Lj_)  in  its  delay  table  with 
these  new  values.  Iti thematically,  the  updated  delay 


Returning  to  Figure  3,  the  arrows  on  the  right-hand 
side  represent  (from  toil  to  head)  increasing  complexity 
and  expected  performance  of  the  algorithms.  Of  all  the 
routing  techniques  shown,  we  feel  that  the  distributed 
stochastic  routing  techniques  have  the  best  potential 
performance  to  offer  in  operational  store-and-forward 
acrputer-conmunicaticn  networks.  These  techniques  oper¬ 
ate  essentially  as  distributed  network  routing  control 
aenters  and  can  adapt  rapidly  to  link  and  node  failures 
as  well  as  to  changing  traffic  conditions. 

IV.  NEPOHC  PEFFOBMfiNCE 

In  order  to  design  optimal  aonputer-ocRimzuication  net¬ 
works  or  to  assess  their  performance,  cne  requires  quanti¬ 
tative  measures  of  network  performance.  There  are  basi¬ 
cally  two  classes  of  performance  measures.  The  first 
class  does  not  relate  in  any  simple  way  to  individual 
messages  in  the  network,  but  rather  to  the  performance  of 
particular  conponents  that  oonpose  the  network.  Examples 
of  such  performance  measures  are:  average  channel  utili¬ 
zation;  nodal  storage  utilization;  and  channel  error 
rates.  Many  of  these  perfosnance  measures  can  be  com¬ 
puted  analytically.  The  second  class  of  performance  mea¬ 
sures  relate  more  directly  to  individual  messages  and 
more  definitive  statements  about  overall  network  perfozm- 
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men  can  be  made.  An  exanple  of  such  a  performance  mea¬ 
sure  Is  the  measured  distribution  of  time  to  transmit  a 
message  through  the  net.  However,  among  the  possible 
performance  treasures,  the  average  message  delay  is  the 
only  am  that  has  yielded  to  analysis.  In  addition,  it 
also  reflects  the  following  network  phenanena  in  its 
amputation: 

•  Message  delay  due  to  formation  of  queues  within 
the  nodes 

•  Nodal  processing  delays 

•  The  decrease  in  effective  channel  capacity  due 
to  the  transmission  of  acknowledgnents  and  rout¬ 
ing  information  within  the  network 

*  Negative  acknowledgments  causing  packet  retrans¬ 
mission 

.  Adaptability  of  the  routing  algorithm  to  varying 
traffic  loads  and  channel  and  node  failures 

*  Packet  looping  caused  fay  momentary  errors  in 
estimation  of  the  required  routes  fay  the  routing 
algorithm,  and 

*  Nodal  storage  blockage 

m  earlier  works  on  gamonmiceticn  nets11  and  computer 
oonnunicatian  networks12'13,  Kleinrgck  studied  such  nets 
using  methods  from  queueing  theory1 '  which  he  showed 
provide  an  effective  method  for  the  canputatim  of  the 
average  delay  of  single  packet  messages  using  fixed 
routing  procedures.  Fultz"  has  modified  these  models  to 
more  accurately  predict  the  single  packet  message  delays. 
In  addition,  he  has  removad  sane  of  the  independence  as- 
stnpticns  discussed  in  Jtef .  11  in  order  to  handle  the 
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fixed  routing  procedure  utilizing  the  network  configura¬ 
tion  shown  in  Figure  1.  Both  the  analytic  and  simula¬ 
tion  models  reflect  an  assured  traffic  matrix  [TO]  whose 
entries  give  the  average  traffic  flow  requirements  in 
bits/aeccnd  between  souroe-des tinatian  pairs  of  nodes. 

In  Figure  7  we  have  scaled  all  entries  in  [TO]  by  a  fac¬ 


tor  re  (called  the  Effective  Data  Rate)  which  allows  us 
to  study  the  average  message  delay  as  a  function  of  net¬ 
work  loading.  Figure  8  shows  the  average  massage  delay 
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for  the  two  priority  classes  as  we  vary  the  mix  of  short 
(single-packet)  high-priority  messages  and  long  (multi¬ 
packet)  low-priority  messages,  while  maintaining  a  con¬ 
stant  average  input  data  rata  to  the  entire  net.  Ibr 
toe  fixed  routing  procedure,  we  see  that  toe  average  mes¬ 
sage  delay  is  adequately  predicted  by  toe  analytic  re¬ 
sults.  However,  when  one  assesses  the  performance  of  Sto¬ 
chastic  routing  techniques,  these,  curves  do  not  indicate 
typical  network  performance.  Kleinrock11  and  Prosser14'13 
hove  given  methods  to  analyze  randan  routing  procedures. 
Here  we  give  a  method  of  estimating  average  single-packet 
message  delay  for  the  isolated  and  distributed  stochastic 
routing  procedures. 

Ns  begin  fay  noticing  that  the  isolated  and  distributed 
algorithms  operate  as  fixed  routing  procedures  over  small 
periods  of  time.  As  time  progresses  and  toe  algorithm 
adapts,  it  utilizes  various  combinations  of  fixed  routing 
procedures.  Of  interest  is  that  fixed  routing  procedure 
which  minimizes  toe  avenge  message  delay  for  a  given  net¬ 
work  loading  factor  IE.  Figure  9  portrays  the  average  _ 
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single-packet  message  delay  as  a  function  of  FE  and 
various  fixed  routing  procedures.  The  lower  envelope 
of  all  these  delay  curves  reflects  the  minimum  average 
message  delay  utilizing  fixed  routing  algorithms.  We 
have  a  procedure  for  oonputing  this  lower  envelope8. 

The  horizontal  line  of  value  TSp  is  the  theoretical 
mininun  average  message  delay  ana  represents  a  solution 
of  the  shortest  path  problem*  far  FE  -  0.  the  shaded 
portion  of  the  figure  represents  a  region  of  operation 
which  can  only  be  penetrated  if  the  stochastic  routing 
algorithm  happens  to  take  exquisite  .advantage  of  the 
instantaneous  characteristics  of  message  flew  within  the 
network  to  produce  a  smaller  average  message  delay  than 
the  best  fixed  routing  algorithm.  To  date,  none  of  our 
(simulation  results  has  penetrated  this  region.  This 
'indicates  that  the  lower  envelope  delay  curve  is  a  good 
measure  of  attainable  performance  for  stochastic  rout¬ 
ing  algorithms. 


For  the  periodic  updating  algorithm,  there  are  two 
parameters  which  may  be  adjusted  for  performance  optimi¬ 
zation  (Dp  and  the  periodic  update  rate  Ry)  for  any 
value  of  HE.  Figure  10  shows  this  performance  as  a 
function  of  -Dp  for  various  values  of  By  with  re  ■  1. 
These  delay  curves  reflect  the  additional  message  delay 
caused  by  the  presence  of  the  routing  update  traffic 
flow  within  the  net.  For  each  routing  update  packet 
(which  contains  the  vector  Vr) ,  its  line  transmission 
time,  Ty,  utilized  in  the  simulation  program  was 
0.8Tp,  where  Tp  («  12.6  msec)  is  the  average  line 
transmission  time  for  a  single  -packet  message  (average 
packet  length  in  bits  divided  by  the  line  capacity  in 
bits  per  sec) . 


•  Dp  IN  MSEC 


PifMf*  10.  Periodic  updatint  Algorithm  (RE  ■  1) 


For  small  Dp,  the  simulation  program  shows  that 
many  loops  exist  in  the  fixed  routing  procedure  utilized 
between  delay  table  updates  and  thus  produces  a  large 
average  message  delay  as  shown  in  the  figure;  the  higher 
values  of  By  shown  permit  better  adaptation  to  the 
traffic,  even  offsetting  the  increase  in  traffic  due  to 
these  updates.  Although  not  shown  in  the  figure,  lim¬ 
ited  simulation  data  indicates  that  the  average  delay 
for  Ry  ■  20  updates/sec  is  larger  than  for  By  ■  10 
updates/sec  j  thus  R»  cannot  be  increased  indefinitely 
without  suffering  a  loss  in  performance. 


Fbr  large  Dp  (60  msec  and  greater) ,  little  evidence 
of  looping  is  fovmd.  The  average  message  delay  for 
Dp  -  200  msec  is  within  two  msec  of  the  simulation  re¬ 
sult  at  re  =  1  for  the  fixed  routing  procedure  based 
upon  the  solution  of  the  shortest  path,  zero-load  prob¬ 
lem.  This  indicates  that  the  delay  table  updating  can¬ 
not,  for  this  value  of  Dp,  adapt  to  the  fluctuations 
in  network  traffic  so  as  to  lower  the  average  message 
delay.  However,  the  algorithm  can  still  adapt  to  line 
and  node  failures  and  the  delay  and  routing  tables  would 
reflect  these  failures.  For  the  simulation  data  plotted 
in  Figure  10,  the  minirum  average  delay  occurs  at 
Dp  =  60  msec,  which  is  approximately  five  times  as  large 
as  the  average  line  transmission  time  Tp  for  a  single 
packet.  In  the  solution  of  TSP  for  this  network,  the 
longest  route  also  oontains  five  lines.  Further  invest¬ 
igation  is  required  to  determine  if  there  is  a  similar 
observable  pattern  for  other  values  of  BE  and  for  var¬ 
iations  in  the  traffic  matrix  [TM]  and  network  top¬ 
ology. 

For  the  asynchronous  updating  algorithm,  there  are 
also  two  parameters  vhich  can  be  adjusted- for  perform¬ 
ance  optimization  (Dp  and  the  threshold  values) .  Here 
we  consider  constant  thresholds  (adaptive  thresholds 
will  be  considered  in  the  future) .  The  simulated  updat  - 
ing  procedure  operates  as  follows:  A  copy  of  the  new 
ndnimun  delay  vector,  Vr,  is  retained  in  node  J  each 
time  it  is  formed  far  updating.  As  packets  are  routed 
at  node  J  via  Bj.  (2) ,  the  minimum  delay  corresponding 
to  GTfjd)  is  oonpared  to  its  corresponding  entry  T(i) 
in  the  stored  vector  Vj  as  shown  below. 

|Tj(i)  -9J(i,<Xk|(i))|  -ATj(i)  (5) 

If  ATj(i)  2  threshold,  then  the  update  procedure  is  in¬ 
voked  as  shown  in  Figure  6.  otherwise,  no  update  occurs. 
The  motivation,  of  course,  for  utilizing  thresholds  is 
too  sense  changes  in  the  traffic  distribution  (delay)  and 
only  update  when  these  changes  are  pertinent  as  opposed 
too  the  periodic  updating  algorithm  which  forces  updates 
even  whan  the  delay  tables  remain  static.  Figure  11 
■hews  the  algorithm  performance  as  a  functicn  of  Dp 
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far  various  threshold  values  with  FE  *  1.  The  curves 
labeled  (N)  indicate  nornai  operation  of  the  algorithm 
(Ty  »  G.&Tp) ,  while  the  curve  labeled  (Z)  corresponds 
to  Ty  -  0.  Thus,  the  difference  between  the  two  SO 
msec  threshold  curves  represents  the  Increase  in  average 
message  delay  due  to  the  presence  of  the  update  traffic 
within  the  net. 

the  asynchronous  update  algorithm  does  not  exhibit 
the  distinct  minimum  message  delay  as  a  function  of  Dp 
as  found  for  the  periodic  update  algorithm.  Also,  no 
correlation  was  found  between  the  number  of  updates  and 
message  delay  for  a  fixed  threshold  value,  even  though 
the  nunber  of  updates  increased  as  Cb  increased  (ex¬ 
cept  for  the  dip  in  the  50  msec  threshold  curves  at 
t^>  *  60  msec) .  For  a  fixed  2  60  msec,  there  is  a 
correlation  between  average  message  delay  and  threshold, 
the  mininun  being  at  approximately  the  50  msec  threshold, 
which  lies  between  the  34  and  100  msec  thresholds. 

Perhaps  the  most  interesting  delay  curve  shown  in 
Figure  11  is  that  for  a  threshold  of  100  irBec.  Fbr 
s  34  msec,  no  updates  occurred  during  the  simulation; 
us  the  algorithm  operation  reduced  to  the  shortest 
queue  +  bias  class.  However,  line  and  node  failures 
would  cause  the  algorithm  to  update.  It  is  quite  pos¬ 
sible  that  the  threshold  test  (Eq.  (5))  ouuld  be  elimi¬ 
nated  and  updating  forced  cnly  when  a  line  or  node  fail¬ 
ure  is  recognized.  This  requires  further  investigation. 
Fbr  Cfc,  s  Tp  msec,  the  algorithm  becomes  highly  unstable 
and  many  loops  appear  in  the  routing.  This  accounts  fbr 
the  large  increase  in  delay,  as  the  figure  indicates. 

Finally,  Figure  12  shows  tbs  best  simulated  perform¬ 
ance  of  three  routing  algorithms  (periodic  updating, 


asynchronous  updating  and  shortest  queuie  +  bias)  as  a 
function  of  the  netarerk  traffic  loading  factor  FE.  The 
lc**r  envelope  represents  the  achievable  average  message 
delay  for  the  best  fixed  routing  scheme  (although  it 
would  very  with  re),  but  has  not  as  yet  been  simulated. 
Hcwever,  tht  results  shown  in  Figure  7  indicate  that 
simulation  s'rauld  agree  with  this  theoretical  cuurve. 

The  periodic  updating  algorithm  is  shown  to  be  infer¬ 
ior  in  performance  to  the  other  algorithms  simula*Td. 


Apparently,  utilizing  a  fixed  routing  procedure  between 
updates  causes  increased  "xugestian  within  the  network 
and  thus  increases  message  delay. 

Fbr  a  moderate  threshold  value  (100  nsec) ,  the  asyn¬ 
chronous  updating  algorithm  achieves  the  sane  perform- 
ance  as  the  shortest  queue  +  bias  algorithm  (because  no 
deity  table  updates  were  initiated  dinring  the  simula¬ 
tion)  .  The  60  msec  threshold  value  produces  a  very 
interesting  result.  As  FE  approaches  1.25,  the  asyn¬ 
chronous  updating  algorithm  performs  better  than  the 
shortest  queue  +  bias  algorithm.  This  shows  that  the 
algorithm  is  utilizing  the  information  contained  in  the 
minimum  delay  rectors  Vj  to  adapt  to  the  fluctuations 
in  network  traffic  flow.  Further,  it  indicates  that 
the  presence  of  the  delay  table  updating  traffic  within 
the  net  does  not  necessarily  cause  an  increase  ii\  the 
average  message  delay. 

Before  a  mere  detailed  corpariscn  can  be  ireude  among 
the  algorithms,  further  understanding  of  the  relation¬ 
ship  between  Dp  and  Ry  or  the  threshold  value  rrust 
be  gained.  In  addition,  line  and  node  failures  must  be 
simulated  in  order  to  determine  how  rapidly  the  algo¬ 
rithms  adapt  and  what  average  message  delay  they  produce. 

v.  ocncueions 

We  have  presented  a  meaningful  overview  of  routing 
techniques  available  for  axiputer-ocRirunication  networks 
and  have  developed  the  structure  of  routing  algorithms 
which  appear  to  be  the  moat  promising  fbr  operational 
networks.  The  main  thrust  of  our  research  has  been  to 
develop  models  of  network  performance  and  rouiting  algo¬ 
rithms  and  compare  their  performance  via  ccnputer  simu¬ 
lation.  Moreover,  preliminary  measurement  data  (time 
delay  measurements,  degree  of  alternate  routing,  etc.) , 
collected  by  Cble4  on  the  AHPA  network,  indicates  gen¬ 
eral  agreement  with  our  simulation  results.  We  are  now 
in  a  position  to  ccnpare  our  analytic  and  simulation 
models  with  real  network  performance  data. 

We  have  demonstrated  that  fixed  routing  procedures 
perform  most  effectively  from  among  our  many  caparisons; 
however,  such  procedures  cannot  adapt  to  variations  in 
network  traffic  and  topology.  The  adaptability  of  our 
distributed  stochastic  algorithms  provides  efficient 
performance  under  such  variations  and  appears  .as  strong 
candidates  for  use  in  stare -and-fbrward  oerputer -can- 
muni  cation  nets. 
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ABSTRACT 


A  theoretical  study  is  given  for  store- and- forward 
oonmunication  networks  in  which  the  nodes  have  finite 
storage  capacity  for  messages.  A  node  is  "blocked” 
when  its  storage  is  filled,  otherwise  it  is  "free."  A 
two-state  Markov  model  is  proposed  for  each  node,  and 
the  number  of  blocked  nodes  in  the  network  is  shown  also 
to  have  a  two-state  Markov  process  representation. 

Digital  canputer  simulations  substantiate  the  theoreti¬ 
cal  results. 

INTRODUCTION 

Consider  a  storecard-forward  ocnmunication  network 
(e.g. ,  see  Refs.  1-5)  consisting  of  nodes  having  finite 
storage  space  for  messages.  During  periods  of  high 
traffic  intensity  this  storage  can  be  expected  to  fill  i 
from  time  to  time.  Ir.  this  condition  the  node  must  re¬ 
fuse  incoming  messages  (which  might  be  accomplished  by 
sending  negative  acknowledgments)  and  we  then  say  that 
the  node  is  "blocked." 

As  soon  as  one  message  is  transmitted  by  a  blocked 
node,  it  becomes  a  "free"  node.  It  remains  in  this 
state  as  long  a3  there  is  at  least  one  empty  space  in 
storage  that  could  be  used  by  an  arriving  message. 

When  the  storage  fills  again,  the  node  re  enters  the 
blocked  state. 

THE  ECOEL 

Figure  1  shews  a  simplified  model  of  such  a  noda  in 
the  terminology  of  the  AWA  network1-5.  The  Interface 
Message  Processor  (IMP) ,  when  free,  accepts  messages 
into  its  main  storage  fron  two  sources:  (1)  other 
IMPs  like  itself,  and  (2)  a  HOST  which  generates  and 
receives  messages  (as  a  source  and  terminal)  and  ccm- 
nunicates  with  the  rest  of  the  network  by  means  of  the 
IMP.  Message  bits  are  sent  in  parallel  to  the  message 
buffer  serving  the  appropriate  output  line,  as  deter¬ 
mined  by  the  final  destination  of  the  message,  and  are 
then  transmitted  serially  to  that  neighbor.  Any  of 
these  output  lines  can  bccone  blocked,  thus  preventing 
their  use. 

In  this  paper  we  study  nodal  blocking  caused  by  the 
finite  storage  roan  for  messages  in  the  D*>  aid  the 
o/erutilization  of  the  systan.  By  overutilization,  we 
mean  that  when  the  node  is  aaoepting  messages,  its  aver¬ 
age  arrival  rate  equals  or  exceeds  its  average  service 
rate  (which  is  the  total  output  channel  capacity  divided 
by  the  average  message  length) .  Elementary  queueing 
theory6  shows  that  if  (1)  the  system  is  underutilized, 
and  (2)  there  is  storage  space  for  approximately  twenty 
messages  or  more,  then  under  fairly  general  conditions 
there  will  be  essentially  r»  blocking. 

Nodal  blocking  is  a  transient  effect  which  should  oc¬ 
cur  only  at  peak  hours  during  the  day  in  a  well-designed 
systan,  but  once  started  it  could  propagate  in  both 
space  and  time.  The  analysis  of  this  propagation  is 
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difficult  for  at  least  three  reasons.  First,  it  involves 
networks  rl  queues,  for  which  only  stationary  results  at 
best  can  generally  be  obtained.  Second,  the  pertinent 
stochastic  processes  are  dependent,  for  if  a  node  becomes 
blocked,  it  cannot  accept  messages  fren  its  neighbors 
and  their  storage  will  tend  to  fill  at  a  faster  rate. 
Finally,  it  is  a  transient  queueing  problem  and  even  the 
simplest  of  these  is  very  difficult  to  solve.  (For  exam¬ 
ple,  the  queueing  system  with  Markovian  arrivals,  a 
single  exponential  server,  and  unlimited  waiting  room  has 
modified  Bessel  functions  in  its  time  dependent  solution6.) 

Since  we  cannot  solve  the  problem  exactly,  our  goal  is 
to  make  good  approximations  that  allow  us  to  analyte  the 
system  and  characterize  its  blocking  behavior  in  sane  way. 
lb  this  end  we  make  the  following  assumptions: 

1.  The  HOST  cannot  bccone  blocked  (it  is  an  infinite 
sink) 

2. a.  Input  traffic  fratt  the  (OST  is  Poisson 

b.  Traffic  on  all  lines  has  the  sane  average  rate 
so  that  total  average  traffic  into  each  node 
is  o  messages/sec. 

3.  a .  Message  lengths  are  exponentially  distributed 
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and,  by  the  assumption  of  exponential  message  lengths, 
the  departure  process  from  cadi  output  line  constitutes 
a  Poisson  stream  when  that  neighbor  is  not  blocked. 


b.  service  (transmission)  time  on  any  lin  is 
therefore  exponentially  distributed  such 
that  for  a  node  with  k  blocked  neighbors, 
the  rate  at  which  messages  exit  from  that 
(k) 

node  is  u  messages/sec. 
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k  -  0,1,  ...,  4  (3) 


4.  Probability  of  an  errpty  queue  in  the  DC*  is 
approximately  zero  (since  the  systsn  is  assumed 
to  be  overutilized) 

ANALYSIS  AND  RESULTS 

Under  these  assumptions  we  arrive  at  a  simplified 
blocking  model  for  a  node  in  the  network,  and  its  de¬ 
scription  as  a  two-state  Markov  process  is  given  in 
Figure  2.  If  the  node  is  blocked,  i.e. ,  in  state  b. 


where  y  is  a  given  system  parameter  and  represents 
the  maximum  message  departure  rate  frem  a  node.  This 
set  of  lumbers  is  merely  an  illustration;  any  coiibina- 
tlon  can  be  treated  by  this  model.  These  results  show 
that  we  can  approximate  the  time  spent  in  the  blocked 
state  as  being  exponentially  distributed  with  parameter 

l»00 . 

The  time  spent  in  the  free  state,  however,  is  distri¬ 
buted  as  the  busy  period  in  a  queueing  system  with  finite 
queueing  roan  far  customers,  as  ua  now  show.  Consider 
the  state  transition  diagram  or  Markov  chain  model  for 
our  single  node  qu  jeing  system  shown  in  Figure  3a. 
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it  becomes  free  in  the  next  instant  of  time  At  with 
probability  |i'k'At  where  k  is  the  nurber  of  blocked 
neighbors  it  is  experiencing  at  that  time.  Similarly, 
if  the  node  is  free,  i.e. ,  in  state  f ,  it  beocmes 
blocked  in  the  next  instant  of  time  At  with  probabil¬ 
ity  A'*' At  where  k  is  again  the  nunber  of  blocked 
neighbors. 


b)  DUAL  QUEUE  STATE  TRANSITIONS 


Below  we  show  the  appropriateness  of  this  model. 

First,  we  require  the  Laplace  transform  of  the  inter- 
departure  time  probability  density  =  D(s) .  For  ary 
node  let  p  =  Pr(non-anpty  node]  and  let  the  Laplace 
transform  if  the  probability  density  of  the  interarrival 
time  prone**  be  A(s) .  Because  we  have  assured  that  the 
service  time  is  exponential  with  parameter  y  (W ,  we 
know  that  the  Laplace  transform  of  the  departure  process, 
conditioned  on  a  non-empty  system  is  y  W/fc  +  y'*c* ) . 
Therefore, 

ou°°  u« 

DCs)  -  BL  n  y  +  (1  -  P)A(S)  — SL-7CT-  (1) 

s  +  ylK'  s  +  yw 

By  assumption  (4)  we  have  p  *  1 

H« 

D{s)  *  -**  -fry  (2) 

S  +  y  w 

which  says  that  the  departure  process  is  a  Poisson 
stream. 
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The  numbers  inside  the  circles  represent  the  nunber  of 
customers  in  the  node.  Customers  arrive  in  a  Poisson 
fashion  with  parameter  a,  and  depart  after  receiving 
service  (exponentially  distributed  with  an  average  of 
1/v  seconds) .  A  busy  period  begins  when  a  customer  ar¬ 
rives  to  find  an  empty  system  (at  which  time  he  imredi- 
ately  enters  the  service  facility) .  Customers  arriving 
during  his  service  time  form  a  queue  behind  him.  With 
each  arrival  the  system  moves  to  the  right  along  the 
state  transition  diagram,  because  the  nunber  in  the  sys¬ 
tem  is  increased  by  one,  and  with  each  service  ocmple- 
tion,  i.e.,  departure,  it  moves  to  the  left.  The  busy 
period  ends  the  first  time  the  system  goes  erpty  after 
initiation  of  the  busy  period. 


Ns  have  assured  that  the  traffic  on  all  lines  has 
the  same  average  rate.  If,  for  example,  every  node  has 
exactly  four  neighbors  and  one  HOST,  then  there  are  five 
output  lines  from  each  node.  All  of  these  lines  axe 
equivalent  (except  that  the  HOST  cannot  became  blocked) 


We  now  consider  a  dual  queue  in  which  the  roles  of 
service  and  arrival  are  reversed,  and  the  numbers  inside 
the  circles  now  represent  the  number  of  enpty  places  in 
storage  that  could  be  used  by  arriving  messages  (see 
Figure  3b) .  The  free  period  of  the  IMP  begins  with  the 


departure  of  a  message  iron  a  previously  filled  system, 
i.e. ,  no  empty  places  for  arriving  messages.  With  the 
transmission  (departure)  the  system  moves  from  state  0 
to  state  1.  It  oontinues  to  move  to  the  right  with  each 
tranani.ision  and  to  the  left  with  each  arrival.  The 
free  period  ends  the  first  time  the  systsn  returns  to 
the  0  state,  the  correspondence  between  the  primal  and 
dual  queues  is  perfect;  thus  any  results  obtained  for 
the  busy  period  in  the  primal  system  are  applicable  to 
the  dual  queue  free  period  in  the  IMP  simply  by  substi¬ 
tuting  iiW  for  o  and  a  for  y. 

The  busy  period  for  a  finite  qreueing  roan  systsn  is 
difficult  to  obtain,  but  the  result  for  unlimited  queue¬ 
ing  roan  is  well  known.  The  probability  density  of  the 
length  t  of  the  busy  period  in  such  a  systsn  is 

p(t)  -  -±-  e"(ff4li,t  X.  (4) 

t/p  1 


Let  pk(t)  ■  P(k  neighbors  blocked  at  time  tj  (8) 

and  let  p(t)  °  P  [node  blocked  at  time  t]  (9) 

Then,  fran  elementary  considerations,  we  have  (correct 
to  within  0  (At) ) 

P(t*At)  -  (l-p(t)  £  P^t )X°°At 
k-0 

where  fran  Bq.  (3)  y^  ■  y^  - 
and  fran  Eq.  (6)  X^  ■  a  -  y®^ 
for  a  >  u^. 

We  also  note  that 

P  +  y«  - 


+  p(t)  (l-^Pk(t)u(k)At) 
(k/5)y(0> 

=  c  -  y(0)  +  (k/5)  p (0) 


(10) 


where  p,  the  utilization  factor  =  (c/y)  <  1  and 
I.  (x)  is  the  modified  Bessel  function  of  the  first 
kind,  of  order  one'.  If  the  size  of  the  queueing  roan 
is  greater  than  20,  the  solution  for  unlimited  queueing 
roan  is  a  good  approximate  solution  to  the  limited  queue¬ 
ing  roam  problem.  (This  follows  since  we  have  assumed 
P  [empty  IMP]  *  0.  But  the  P  (empty  IMP]  corresponds  to 
the  probability  of  being  in  state  N (i.e.,  all  N  spaces 
are  snpty)  in  Figure  3b,  and  thus  an  increase  in  N  will 
not  seriously  affect  our  results.)  since  we  have  as¬ 
sumed  overutilization,  we  have  (y  (0)/o)  <  1,  and  we 
are  justified  in  substituting  this  (or  y^'/cr)  for  p. 
Thus  we  get  the  following  for  the  probability  density  of 
the  length  t  of  the  time  spent  in  the  free  state: 


As  the  ratio  y ^  /o  approaches  0,  i.e.,  as  the  system 
becomes  more  overutilized,  this  density  approaches  that 
of  the  exponential  distribution.  TO  arrive  at  a  more 
tractable  model,  we  approximate  the  free  period  distri¬ 
bution  by  the  exponential  distribution  having  the  sane 
mean  value.  The  mean  value  of  the  busy  period  in  the 
original  system  is  easy  to  obtain,  and  is  given  by 
VWl-p)  .  Therefore,  as  an  approximation  to  the  free 
period  in  the  IMP,  we  take  an  oponential  distribution 
with  mean  value  l/(o~U  *')»  i.e.,  with  a  parameter. 


Thus,  “  (1  -  P(«)  £pk(t)X(k> 


-p(t)£pk(t)y(k> 


Letting  At  approach  0,  vre  have 


-p(t)£pk(t)(X(k)  +y(k>)  +  £Pk(fc)X(k> 
ot  Jc-0  k-0 

-  -0P(t)£Pk(t)  +  JjPk(t)  (a  -  y(0)  +|  y(0)) 

-  -op(t)  +  o  -  y<0)  +  H,!?L£kPk(t)  (11) 

5  k-0 


This  can  be  simplified  by  noting  that 


E [number  of  blocked  neighbors  at  time  t]  -  T'H>k(t)  (12) 

krt 


For  the  marginal  case,  a  -  y  ^ ,  elementary  queueing 
theory®  shows  that  we  must  take 


there  E  denotes  expectation. 
Define  the  indicator  function 


,  (0)  a 
A  N 


for  o 


.(0) 


(7) 


fn(t) 


where  N  is  the  size  of  the  storage  capacity  in  the 

IMP.  Now  let 


1  if  node  n  is  blocked  at  time  t 
0  otherwise 


Our  model  for  the  blocking  IMP  is  thus  a  two-state 
Markov  process  or,  in  the  language  of  renewal  theory, 
an  alternating  Poisson  renewal  process8.  One  way  to 
describe  the  dynamics  of  a  network  of  such  nodes  is  to 
examine  the  probability  that  any  given  node  is  blocked 
at  sane  time  t.  Oons'  der  a  node  with  four  neighbors 
numbered  1  to  4: 


4 


pn(t)  -  P[node  n  is  blocked  at  time  tj 
then  E[£n(t)]  -  pn(t) 

Further,  fran  Bq.  (12) ,  we  have  that 


E(£fn(t)) 

neN 


-  £E(fe(t)) 
neN 


(13) 


(14) 


9$ 


wht  *  N  is  the  set  of  neighbors  for  this  node  (of  which 
thei  ore  four).  From  Eqs.  (13)  and  (14)  we  get 


(0) 


PB(t  +  At)  ■  F^(t)  (1  -  At)  +  (1 


-  Ffe(t))  (o  -  p(0)A t 


JjkP^t)  -  p^t)  +  p2(t)  i-  p3(t)  +  p4(t)  (15) 


.  «%<« 
*  *  ~3t~ 


-r^(t)(o- Jp(0))  +  (o  -  y(0)) 


Finally,  fran  Eqs.  (11)  and  (15)  we  have  the  result 


or 


«  -op(t)  +  a-y<°> 
u(0) 

+  5 -~(P^(t)  +  Pj(t)  +  Pj(t)  +  (t) ) 


06) 


PgtO)  - 


.  -  »<“ 


o-V°> 


o-/»» 

* 


(18) 


This  relation  can  also  be  derived  fran  epidemiology  by 
considering  nodal  blocking  as  a  deterministic  epidemic 
without  migration  and  with  but  two  kinds  of  individuals, 
infected  and  susceptible’ . 

Adjacent  nodes  have  nearly  equal  probabilities  of 
being  blocked.  Consider  the  case  when  all  of  these 
probabilities  are  exactly  equal  (as  an  approximation) . 
Then  from  Eq.  (16) 

■  -op(t)  +  o  -  v*0)  +  5  u^p(t) 


This  is  the  same  as  Bq.  (17)  which  was  obtained  for  the 
probability  that  a  node  is  blocked  at  time  tl  In  a 
large  homogeneous  systan  the  fraction  of  blocked  nodes 
may  be  closely  approximated  by  the  probability  that  any 
one  of  than  is  blocked.  Therefore,  the  fraction  of 
blocked  nodes  at  time  t  ~in  a  large  uniformly  connected 
(i.e. ,  two- -dimensional  lattice)  network  is  approximately 
equal  to  the  probability  that  the  tap-state  Markov  process 
shown  in  Figure  4  is  in  the  blocked  state  at  time  t. 

Thus  we  may  take  this  two-state  Markov  process  as  a  nodel 
for  the  network. 


-(O  -y  P<0,jp(t) 


+  a 


-p(°) 


WvLch  has  the  solution 


So  far  we  have  presented  only  aggregate  results.  To 
obtain  the  probability  that  any  given  node  in  the  network 
is  blocked  at  time  t  we  must  consider  a  systan  of  equa¬ 
tions  of  the  farm 


P(t) 


p(0)  - 


-M(°> 


-(a-  $,«)t 


a  - 


-w(0) 

~T*75) 


(17) 


^(t) 

. <k 


-  opjft)  +  a  -  p(0) 

u(°) 

+  *3—  (Pj(t)  +  P^(t)  +  p£(t)  +  f^(t)) 


Now  consider  the  alternating  Poisson  renewal  process 
shown  in  Figure  4.  There  are  two  states,  called  (B)  and 


for  each  node  i  in  the  network  with  neighbors  j,  k,  l, 
and  m.  These  equations  are  obviously  of  the  form 


#(t)  -  AP ft)  +  C 


(19) 


T 


F  4.  NftWuffc  Model 


If  there  are  n  noc’.js  in  the  net,  then  P(t)  is  the 
n  x  1  matrix  whose  i  ■"  oenponent  is  the  probability  that 
node  i  is  blocked  at  time  t.  A  is  an  n  x  n  constant 
matrix  and  C  is  an  n  x  1  constant  matrix.  The  solu¬ 
tion  is  well  known: 


P(t)  -  ef^PfO)  +  A~1(eAt  -  l)C  (20) 


For  a  snail  net  this  solution  poses  no  difficulty,  but 
for  a  large  one  the  required  matrix  confutations  rapidly 
get  out  of  hand.  There  are  soma  special  cases  which  are 
solvable,  however,  and  we  obtain  the  solution  far  one  of 
these  below. 


free  (F) .  If  the  systan  is  in  the  blocked  state  at 
time  t,  it  goes  free  in  the  next  instant  of  time  At 
with  probability  (u'°V5)  At.  In  similar  fashion,  the 
probability  that  it  leaves  the  free  state  and  re  enters 
the  blocked  state  is  (a  -  u'°>)At.  Therefore,  the 
probability  that  it  is  in  the  blocked  state  at  time 
t  +  At  is 


Oonsider  a  network  consisting  of  1024  nodes  arranged 
in  a  32  x  32  grid.  Bar  this  systan  the  matrix  A  is  1024 
x  1024  and  takes  the  following  form: 


D  A 
ADA 


O 


A  *= 


A  D  A 


n 


(21) 


ADA 


(22) 


where  0  ■ 


a  b 
b  a  b 
b  a  b 

O 


0 


b  a  b 
b  a 


n  x  n 


and 


A  -  bl_ 


.(0 


where  a  =  -o,  b  -  -*  and  In  is  the 

n  x  n  identity  matrix 


(23) 


(25) 


This  observation  holds  for  a  square  grid  with  an>  nuttier 
of  nodes  n  on  a  side.  (See  the  Appendix  for  the  com¬ 
plete  solution  for  P(t)  far  arbitrary  n.)  'fats  case 
of  n  “  1024  was  simulated  and  is  described  in  the  follow¬ 
ing  section. 

SIMULATION  FBSULTS 

Simulation  of  a  network  of  1024  nodes  employing  the 
Markovian  inter-event  time  assumption  substantiates  the 
approximations  described  in  the  theoretical  results 
above.  IWo  different  simulation  programs  have  been  run 
on  the  IdA  XDS  Sigma-7  computer.*  Hie  first  was  for  a 
network  arranged  in  a  square  grid  32  x  32.  Each  node  Is 
connected  to  its  four  nearest  neighbors  (a  lattice)  ex¬ 
cept  in  the  case  of  nodes  along  the  border  which  have 
only  three  nearest  neighbors  (or  two  nearest  neighbors 
in  the  case  of  the  four  corner  nodes) .  When  a  node 
changes  state,  new  event  times  are  chosen  far  it  and  far 
all  of  its  nearest  neighbors  based  on  the  new  number  of 
blocked  neighbors.  Hie  memory  less  property  of  the  expo¬ 
nential  distribution  simplifies  the  calculations. 

Hie  second  program  simulated  a  randomly  connected 
graph  In  which  each  node  was  given  exactly  four  neigh¬ 
bors. 

Comparison  of  the  two-state  Markov  process  model  and 
the  simulation  results  for  the  lattice  and  the  random 
graph  are  shown  in  Figure  5  for  one  set  of  parameters 
a  and  p'0'  starting  from  a  net  that  is  acnpletely 
blocked.  Figure  6  shows  the  results  when  the  network 
begins  with  all  of  its  nodes  in  the  free  state.  In 
Figure  7  results  are  catpared  for  the  model  and  the  two- 
dimensional  integer  lattice  in  which  each  node  is  as¬ 
sured  to  have  eight  neighbors.  This  was  aoccnplished 
by  extending  the  nearest  neighbor  definition  to  include 
nodes  vhich  are  diagonally  adjacent.  Hie  results  in 
Bq.  (18)  are  extended  in  the  obvious  way.  Figure  8  can- 
par  es  simulation  results  on  the  lattice  of  degree  four, 
vhen  a  free  node  with  k  blocked  neighbors  is  ccnsider- 
ered  k -fourths  blocked,  to  the  predicted  trajectory  based 
on  a  non-linoar  "partial  blocking"  model.  The  agreement 
with  the  simulations  is  generally  good,  and  the  model  is 
sufficiently  general  to  treat  a  variety  of  cases. 

CONCLUSIONS 

Tvo  new  models  that  may  have  application  to  store- 
and -forward  cormunication  networks  are  presented  in  this 
paper.  The  probabilistic  model  for  nodal  blocking  due 
to  finite  storage  space  is  shown  in  Figure  2.  Tte  seoorri 
model,  and  the  main  result  of  this  work,  is  that  the 
fraction  of  blocked  nodes  in  a  network  of  such  nodes  has 


•During  simulation  the  net  activity  was  displayed  on 
a  Digital  Equipment  Corporation  340  Precision  Display  CRT. 
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Bq.  (18) ) .  Figures  5-8  verify  that  the  netvnrk  model 
ocRf tores  hell  with  results  obtained  fran  the  simulation 
of  a  network  of  two-state  Mirkovian  nodes  in  which  the 
time  spent  in  either  state  is  a  function  only  of  the 
state  and  the  nuitoer  of  blocked  neighbors.  Finally,  the 
model  is  sufficiently  general  to  treat  a  variety  of  net¬ 
work  configurations  and  parameters. 
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It  is  easy  to  verify  that  the  normalized  solutions  are 


sin 


and 


where 


kvrr 
n  +  1 

D 

*?.l*  .§eTV^1,» J 
“Ijj  '  £  'V1  \l  *vj 


Similarly,  it  is  easy  to  show  that  the  transformation 
R*AR  (where  R*  is  the  transpose  of  R)  where 


R 


*•*  **• 

x12In  •**  •••  ^>2^» 

**•  *•* 


with  as  given 


Me  must  first  find  the  eigenvalues  yy  of  D  which 
are  the  solutions  of  |D  -  yl|  «  0.  Let  a  stand  for 
a  -  y  in  D*.  we  wish  to  find  the  zeros  of  the  determi¬ 
nant  of  D.  Expanding  by  the  elements  of  the  top  row, 
we  note  the  following  recurrence  relation  for  the  dater- 
minant  Aj,  of  the  n  x  n  matrix  D: 

^  “  aAn-1  -  ^n-2 

with  initial  conditions  Aj  »  a,  Ag  *  1,  A_^  *  0.  Fol¬ 
lowing  Grenander  and  Szego^we  substitute  a  ■  2b  cos  8, 
assume  a  solution  of  the  farm  *  p°,  and  solve  the 
resulting  quadratic  in  p.  After  satisfying  the  initial 
conditions  the  result  is  sinply 

a  «  vjn  sin(n  +  1)8 

^  3n~e 

which  vanishes  for  8  *  vrr/ntl  v  ■  1,2,  ..,  n 

Therefore,  the  eigenvalues  of  D  are 

a  -  2b  cos  JJF  v  ■  1,2,  ..,  n 

n  +1 

which  are  all  distinct.  The  eigenvectors  are  the  solu¬ 
tions  C/f 


above  reduces  A  to  the  quasi-diagonal  form 

M‘  O 

« 2  ~ 

...  f 

O  5. 


where 


Mv-  D  -  2b  oos 


Since  My  is  equal  to  D  with  a  change  of  the  diagonal 
element,  we  have  that  the  (k,l)  element  of  the  (i,  j) 
block  of  e*  is 

*i,  j)k,l  ‘  £"*<**>  cos£  -2b  cos^X^ 


and 


S(-2>  =*5T  -a  “*iSrVlPl 


where 
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In  our  system  a  *  -a  and  b  ■  p^/5  so  the  tine 
constants,  i.e. ,  the  arguments  in  each  of  the  exponen¬ 
tials  appearing  in  the  solution  for  eAt  are  of  the 
form 


which  takes  on  its  smallest  absolute  value  for 

v,  m  v.  m  n.  thus  the  motion  of  the  system  is  bounded 

by  J 

exp  -  (o  -  i  p(0)  oos  —Ljt 
5  n  t  1 


The  nuiber  n  is  the  square  root  of  the  nimtoer  of  nodes 
in  the  square  lattice.  This  result  shows  that  as  n  *  • 
the  system  attains  its  steady  state  at  a  rate 

exp  -  (o  -  ^  u(0))t 

which  agrees  with  simulation  results  for  n  *  3?. 
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In  many  computer  communication  systems,  random  length  messages  arc  partitioned  into  fixed 
oi/.e  blocks  for  case  in  data  handling  and  memory  management.  Wien  error  detection  and  retrans¬ 
mission  arc  used  in  the  error  control  procedures,  there  is  at  least  one  acknowledgment  delay  asso¬ 
ciated  with  each  transmitted  message  block.  Thus,  from  an  acknowledgment  point  of  view,  it  is 
dcslx’abl.c  to  select  the  larger  block  size  so  as  to  yield  the  fewer  acknowledgments  per  message.  On 
the  other  hand,  the  larger  message  block  has  a  higher  error  probability  and  so  may  result  in  more 
retransmissions  and  more  acknowledgment  delays  than  the  shorter  message  block  size.  Further, 
due  to  the  random  length  of  messages,  the  last  partitioned  block  usually  cannot  be  entirely  filled  with 
messages  and  is  filled  with  dummy  information.  The  large  block  size  has  a  larger  amount  of  such 
wasted  channel  bandwidth;  hence,  it  is  desirable  to  partition  the  message  into  smaller  block  size. 
Thus  there  is. a  trade-off  among  acknowledgment  delay,  message  error  probability,  and  the  wasted 
channel  bandwidth  due  to  the  last  unfilled  partitioned  block.  A  mathematical  model  is  developed  in 
tills  paper  to  determine  the  optimal  message  block  size  that  maximizes  channel  efficiency.  Using 
tho  model,  the  relationships  among  acknowledgment  time,  channel  transmission  rate,  channel  error 
characteristics  (random  error  or  burst  error),  average  message  length,  and  optimal  block  size  arts 
obtained  and  presented  in  graphs.  The  model  and  the  graphs  should  be  useful  as  a  guide  in  the 
selection  of  tho  optimal  fixed  message  block  size  for  computer-communication  systems. 


1.  INTRODUCTION 

To  increase  utilization  of  computer  capa¬ 
bility  and  to  share  computer  resources,  re  - 
motcly  located  computers  and/or  terminals  may 
be  connected  with  communication  links.  Such 
integrated  computer  communication  systems 
allow  many  users  to  economically  share  data 
bases  and  computer  software  systems.  These 
shared  computing  facilities  can  greatly  increase 
our  computing  capacity.  In  the  design  and  plan¬ 
ning  of  such  systems,  communication  problems 
between  computers  and  terminals  greatly  in¬ 
fluence  system  performance  (e.  g. ,  inquiry- 
response  delay)  and  overall  system  costs, 
lienee,  computer  communications  become  an 
integral  part  of  the  overall  system  design  con¬ 
sideration.  For  example.  Asynchronous  Time 
Division  Multiplexing  [1)  has  been  proposed  for 
dal  a  communication  to  increase  channel  utiliza¬ 
tion  and  reduce  communication  costa.  In  this  ' 
paper  we  address  the  problem  of  determining 
the  optimal  fixed  message  block  size  to  improve 
.efficiency  in  data  communication  systems. 


!  Kirlin  [4)  have  studied  the  optimal  message  block 
size  for  the  error  detection  and  retransmission 
system  that  maximizes  transmission  efficiency. 
In  this  paper,  we  consider  an  additional  impor¬ 
tant  parameter  —  the  average  message  (file) 
j  length  —  in  determining  the  optimal  message, 
block  size,  which  significantly  effects  the 
selection  of  the  optimal  message  block  size. 

For  economic  and  reliability  reasons  [5,  C), 
the  error  detection  and  retransmission  scheme 
•  is  used  by  many  data  communication  systems. 
Using  this  error  eontrol  technique,  encoding 
and  decoding  circuits  usually  arc  required  to 
'  operate  on  the  message  information  to  process 
redundant  data.  The  receiving  end  checks  the 
received  message  (together  with  the  redundant 
data)  and  then  generates  an  acknowledgment 
1  signal  for  the  sender  to  indicate  whether  the 
message  is  correctly  received,  If  the  message 
is  eorreetly  received  (positive  acknowledg¬ 
ment),  then  the  sender  is  permitted  to  send  a 
new  message.  If  the  message  is  incorrectly 
received  (negative  .acknowledgment),  then  the 
sender  retransmits  the  same  message,  until  a 


Knccra  [2],  Balkovic  and  Mucnch  (3),  and 


positive  acknowledgment  of  that  message  is 
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(11-1)  Gen  -  10,  Project  14  and  U.  S.  Offico  of  Naval  Research,  Research  Program  Office,  Contract 
No.  N00014-G9-A-0200-4027,  NR  048-120.  '  - 
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ANALYSIS 


received  by  the  sender. 

The  message  outputs  from  a  computer  arc 
usually  In  strings  of  characters  or  bursts.  The 
message  length  may  be  different  from  one  to 
another  and  can  best  be  a  scribed  by  a  proba¬ 
bility  distribution.  For  ease  in  data  handling 
and  memory  management,  the  random  message 
length  is  usually  partitioned  into  several  fixed 
size  blocks.  Dub  to  the  random  length  of  the 
message,  the  last  partitioned  block  usually  can¬ 
not  be  entirely  filled  by  the  message  and  is 
filled  with  dummy  information.  From  the  ack¬ 
nowledgment  point  of  view,  it  is  desirable  to 
select  the  largest  possible  block  size.  Since 
each  message  block  requires  at  least  one  ack¬ 
nowledgment  signal,  the  fewer  the  number  of 
blocks  needed  for  a  message,  the  less  the  chan¬ 
nel  capacity  required  for  acknowledgments.  On 
the  other  hand,  since  the  larger  message  block  * 
lias  a  higher  probability  of  error  and  also  has  a 
liighcr  clianncl  wastage  due  to  the  last  unfilled 
partitioned  block.  It  is  desirable  to  select  tho 
smallest  possible  block  size.  Thus  there  is  a 
tradeoff  in  selecting  the  optimal  block  size.  The 
basic  problem  is:  suppose  the  average  messago 
length,  tho  message  length  distribution,  tho 
•  channel  error  characteristics,  block  overhead, 
and  the  acknowledgment  overhead  arc  known. 

What  is  the  optimal  message  block  size  that  mini¬ 
mizes  the  time  wasted  in  acknowledgments, 
retransmissions,  and  the  last  unfilled  block? 

A  mathematical  model  is  developed  in  tills 
paper  to  treat  this  problem.  Tho  model  con¬ 
siders  two  types  <vf  error  channels:  1)  Random 
error  channel;  that  is,  he  errors  arc  generated 
in  a  statistically  independent  fashion  and  the 
error  rate  can  be  approximated  as  a  linear 
function  of  tho  block  size,  and  2)  Burst  error 
.  channel;  that  is,  tho  errors  arc  generated  in  a 
statistically  dependent  fashion  such  as  tho  noise 
produced  by  radio  static  or  switching  transients, 
and  the  error  rate  is  a  nonlinear  function  of 
block  size.  In  general,  error  characteristics 
can  be  obtained  only  from  actual  measurements 
[7}.  For  a  given  average  channel  error  rate, 
the  performance  of  the  burst  error  channel  is, 
in  general,  better  than  that  of  the  random 
error  channel  (0).  Using  our  mathematical 
model,  the  relationships  among  message 
length  (assuming  the  messages  arc  Tgeometri- 
cally  distributed),  transmission  rate,  acknowl¬ 
edgment  overhead,  block  overhead,  and  optimal 
block  size  arc  obtained  and  portrayed  in  graphs. 
These  graphs  and  the  model  should  bo  useful  as 
a  guide  or  tool  in  selecting  optimal  message 
block  oiz'c  for  planning  computer-communica¬ 
tion  systems. 


The  message  length,  L,  is  a  random 
variable  and  can  be  described  by  a  probability 
distribution  PjJL£l  with  average  length  l  char¬ 
acters  per  message.  When  the  message  is  par¬ 
titioned  into  a  fixed  size  block  of  B  characters 
per  block,  the  expected  number  of  blocks  per 
message  is  equal  to 

oO 

N(B)  -  £n.P,{(n-l)B  <£S  n)3}  J-1,2,..  (1) 
n^l 

The  structure  of  a  fixed  message  block 
consists  of  an  address,  b1#  in  the  front  of  the 
message  block  and  a  checking  code,  b2,  after 
the  message  block,  as  shown  in  Figure  1. 


i 


|  B  ■  FIXED  SI2E  MESSAGE  BLOCK 
|  b|  ■  ADDRESS 

b£*  CHECK  BITS 

i 

,  b  •  b,  ♦  b2  •  BLOCK  OVERHEAD 

Fig.  1.  Data  Structure  of  ft  Fixed  Size  Message 
Block. 

The  overhead  of  such  a  block  is  then  equal  to 
■  b  ■  bj  +b2  characters.*  Thus,  for  a  message 
block  size  of  B  characters,  the  whole  block 
length  is  equal  to  B  +  b  characters.  Let  E(B+b) 
be  the  probability  that  a  block  of  B  +  b  charac¬ 
ters  (a  message  block  of  B  characters  with  a 
block  overhead  b  characters)  transmitted  over 
a  channel  will  have  at  least  one  error.  Wo 
know  that  E(B+b)  is  dependent  on  both  the  error 
characteristics  of  tho  channel  and  the  whole 
block  size  B  +b.  Clearly,  a  larger  value  of 
B+b  and/or  a  noiser  transmission  channel 
yields  a  higher  value  of  E(B+b).  When  error 
•  detection  and  retransmission  arc  used  for  error 
control,  there  is  a  certain  amount  of  acknow¬ 
ledgment  delay.  A,  associated  with  each  mes¬ 
sage  block.  In  a  half  duplex  transmission  mode, 
the  acknowledgment  delay  should  also  include  tho 

'modem  turn-around  times  if  modems  arc  used  i 

(in  the  channel.  Thus,  the  expected  acknow.T 
lodgment  overhead  for  a  message  block 

;*We  assume  the  units  of  characters  for  consist¬ 
ency  and  allow  the  reader  to  insert  tho  number 
|  of  bits  per  character  to  fit  his  implementation. 
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transmitted  over  a  channel  la  equal  to  the  first 
ucknowled/jmcnt  time  plus  the  expected  retrans¬ 
mission  time  and  rcacknowlodgmcnt  time. 

Ah. summing  tliat  the  probability  of  error  of  each 
message  block  during  transmission  is  independ¬ 
ent  of  transmission  or  retransmission,  then 
mathematically  the  expected  acknowledgment 
overhead  for  a  message  block  of  size  B  (or 
whole  b.’nck  size  B+b)  on  a  channel  with  a  trans¬ 
mission  rate  11  charactcr/scc  la 

CO 

A(B  +  b)*=.A  +  ^  |e{B +b>p •( A  +  — (2) 

The  expected  wasted  time  due  to  acknow¬ 
ledgement  and  retransmission  in  transmitting  a 
message  in  fixed  sized  blocks,  Wi(B),  is  equal 
to  the  expected  number  of  blocks  (of  size  B+b) 
per  message  multiplied  by  the  expected  acknow¬ 
ledgment  overhead  of  each  block.  Thus, 

Wj(B)  -  N(E)  •  A(B+b) 

-N(B)*|A+^[E(B+b)Ji«(A+^~)|  (3) 

Since  0  s  E(B+b)  £  1,  B>  0,  and  b  >  0, 

£[E(B+b))1.^^  (4) 

Substituting  (4)  into  (3)  yields 

Wl(D)!N(B)-|A*a|^-.(A+a±j!)J  (5) 

The  expected  wasted  time  due  to  block  over¬ 
head  and  the  last  unfilled  partitioned  block  in 
transmitting  a  message  in  fixed  sized  blocks, 
W2(B),  is  equal  to  the  difference  botween  tho 
time  to  transmit  the  blocked  message  and  the 
unblocked  message. 

Thus 

W2(B)  -  N(B)  •  .  1+bl  (6) 

where  b'  is  tho  overhead  for  the  unblocked 
message. 

The  total  expected  wasted  timo  to  transmit 
a  message  in  blocks,  W(B),  is  equal  to  the  sum 
of  (&)  and  (G).  Thus 

We  wish  to  find  the  optimal  block  size  B°  that 
minimizes  (7);<  that  is, 


W(B°)  ■  min 
B 


E(B+b)  /.  B+b\ 

T-EtraAAT— ) 


(G) 


Let  us  assume  that  the  message  length  is 
geometrically  distributed;'*  thaLis,  P^(f)  «• 

■—  pq^"1,  iul,  2,  3, ... ,  Withaverage  message" 
length  i  ■  p_1  and  where  p  +  q  » 1.  Tho  average 
number  of  blocks  per  message  in  this  ease  eon 
bo  computed  from  (1)  and  is  equal  to 
“  ,  . 

N(B)»  Ij  n*  P  Un-1)B  <  J  £  nB>  4-1,2,.,. 
n-l  ;  _ - 


l- 1 


“»  nB 

■  Z n  *  Z  pq 

n=l  4-(n-l)B+l 


Z  n  •  a  -qB)  (qV"1 

n-1 


B. -1 

(1  -q  ) 


(9) 


Substituting  (9)  into  (7),  we  have 

B  +  b]  1  b* 

R  J-  “  "  “ 


l  R  J  pR  R 

«  . 

I  To  minimize  W(B),  wo  take  the  derivative  of 
j(10)  and  set  it  equal  to  zero, 

!  aw(B) 

SB 


(10) 


0. 


or 


(Where  E’  (B+b)  is  the  first  derivation  of  E(13+b). 

It  can  be  shown  that  the  second  derivative 
'of  W(B)  (Eq.  (10))  with  respect  to  B  that  satis¬ 
fies  (11)  is  positive.  lienee,  solution  of  (11) 
yields  the  minimum  W(B),  Due  to  the  complex¬ 
ity  of  (11),  a  closed  form  solution  of  B°  cannot 
be  obtained,  Tims,  numerical  techniques  arc 
used  to  solve  (11)  for  B  . 

Let  us  study  the  behavior  of  Equation  ( 1 1) 
and  denote  1 


X(B)»(q"B-l) 


♦Measurements  collected  from  several  time 
sharing  systems  revealed  that  the  message 
length  output  from  these  computers  can  be  ap¬ 
proximated  by  a  geometrical  distribution  [9]. 
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thus 

X(D)  -  Y(B)  -  0  (12) 

First,  let  us  evaluate  X(B)  and  Y(0)  at 
B  -  0, 

X^lli-O-0 

a»dy(D)|B=0«[l  +  ^J  |jnq|  OSqSl 

Thus,  Y(0)  5  X(0).  Further,  if  the  slope  of 
X(B)  is  greater  than  the  slope  of  Y(13)  for  all 
13  J:  0,  then  X(B)  intersects  Y(I3)  at  some  B, 

0  <  B  <»,  as  shown  in  Figure  2.  This  implies 


Fig.  2. 


dxo!> 

da 


dY03) 

dli 


and  Y(0)  >  X(0) 


that  \V()1)  is  a  convex  function  of  13.  Hence, 
the  optimal  block  cine  13  located  by  the  numer¬ 
ical  technique  attains  a  global  optimal. 

Next,  differentiating  X(B)  and  Y(B)  with 
.respect  to  13,  we  have 


d>:<15) 

dli 


E'(B+1>) 

l'-E(B+bT 


I 


where  E"(1J  +b)  is  the  second  derivative  of 
E(B+b). 

Since  the  error  probability  increases  as 
B  increases,  E'  (B  +b)a  0.  Also  since 
0  <  E(B+b)S  1,  the  second  term  of  (13)  is 

greater  than  — .  Further.'  if  E"(B+b)  £  0, 


tnen  the  first  term  of  (13)  is  positive. 
d\'(H)  _  dY(B)  ,  „  „  ... 

"dLi —  “  Ior  aU  Therefore, 


Thus, 


E"(B+b)  >  0  is  a  sufficient  condition  to  assure 
the  convexity  of  W(D).  The  physical  meaning 
of  E"(B  +  b)£  0  implies  that  E(B+b)  is  a  convex 
function  of  B  +b.  Comparing  Equations  (13) 
and  (14),  wo  know  that  even  if  E"(B+b)  <  0  for 

some  B,  -■  £  — -  might  still  be  satisfied. 

nicrcforc,  E"(B+b)  £  0  is  not  a  necessary 
condition  for  convexity  of  VV(B). 

In  the  following  we  shall  analyze  the 
optimal  message  block  size  for  two  types  of 
error  channels:  random  error  channel  and 
burst  error. channel. 


Random  Error  Channel 


For  a  random  error  channel,  the  prob¬ 
ability  that  a  block  of  B  +b  characters  trans  - 
xnlttcd  over  a  channel  will  have  at  loast  one 
,  orror  is 

E(B+b)  -  1  -(l-K)B+b  ,  . 


■  1 


-  [l-  (B+.b)K+ 


(B4b)(B+bTl)K2 

21 


(B+b)(B+h-l)(B+b-2)  „3 

i  "  31  K 

\  ■ 

'  where  K  equals  average  channel  character 
error  rate.  For  most  practical  systems, 

1  (B+b)* K«  1  (e.g.,  (B+b)K- 1000- 10"*- 0.1). 
j  Hence,  (15)  can  be  approximated  as 

j  E(B  +  b)  -  (B  +  b)K  (1C) 

I  which  is  linearly  proportional  to  the  total  block 
i  size  B+b.  Physically  (1C)  implies  that  E(B+b) 

,  is  approximately  equal  to  the  expected  number 
of  error  chai'actcrs  in  the  block  during  trans  - 
mission.  The  first  and  second  derivatives  of 
E(B+b)  equal  to  E»(B+b)=K  and  K"(B+b)  -  0. 

To  determine  the  optimal  message  block 
size,  we  substitute  E(B+b)  -K(B+b)  and 
E'  (B+b)  ■  K  into  (11).  We  have 
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Since  E"(n+b)  *  0,  \V(B)  ia  a  convex 
function  of  J3.  Rased  on  the  values  of  tlic  prod¬ 
uct  of  acknowledgment  delay  and  transmission 
rate  Alt,  I<,  and  H  =  {1 -q)  the  Newton - 

Hnphson' s  Iterative  Method  ( 10]  can  be  used  to 
solve  (17)  for  tlic  optimal  average  block  size  B°. 
The  iteration  is  terminated  when  the  improve¬ 
ment  on  \V(JJ)  from  each  new  13  is  less  than  10 
seconds  and  the  difference  between  the  value  of 
new  13  and  its  previous  13  is  less  than  0. 1  char¬ 
acters.  The  optimal  mossajjc  block  size  for 
selected  ranges  of  AR,  K,  i  and  b  are  por¬ 
trayed  in  Figure  3. 


...  (3a)  K  ■  10'4,  b  ■  0 


C£ 


(3b)  K  ■  10“6,  b  ■  0 


Fig.  3 ,  Optimal  Fixed  Message  Block  Size  for 
Random  Error  Channels  when  Message 
Length  is  Geometrically  Distributed. 


Burst  Error  Channel 


In  a  burst  error  channel,  the  error  tends 
to  cluster  rather  than  be  evenly  distributed 
over  the  messages.  For  example,  the  noise 
produced  by  radio  static  or  switching  transients 
may  cause  such  burst  errors.  Two  measured 
burst  error  channel  characteristics  [7]  are 
shown  in  Figure  4. 


MESSAOE  OlOCK  SIZE  D  «  k  (CHANS) 
t  CHANACTEN  ■  IP  BITS 


Fig.  4,  Burst  Error  Channel  Characteristics 
(Alexander  -Gryb  -Nost) 
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Iii  order  to  express  these  curves  mathemat¬ 
ically,  a  curve  fitting  technique  [10]  is  used  to 
represent  Kfihb)  as  a  polynomial.  A  good  tit 
with  mean  of  the  square  errors  B  0. 150x10"** 
for  the  GOO  bits/ sec  channel  is 

E  (B+b)  -  0,HC  -  O.ICO  [lo"10<B+b)] 

s-  0.045  [loK10<n«b)]2  US) 

la  the  same  manner,  a  good  fit  with  means  ot 
the  square  errors  ■  0.70x10  ®  tor  the  1,200 
bits/ sec  channel  is 

K„(n+b)--0. 145+0. 15011oc10(n+b))-O.  10C[lo£.0(a+b)]3 

+O.121xl0'1[lpitJ0(n+bS3  +0.243xl0‘2[log10(B+b)J4 

-0.  MflxlO^loc^tB+bJI5  +  0.2GCxl0’3[lo£10<B+b)]6 

•O.273xl0‘V«>h'10<B+b))7  -  0.  lC2xl0‘VoC10(«+bM° 

+O.23OxiO‘5[lo0lo(Ii+b)l°  • 

for  200  £  B+b  £  2xl03  chars  (19> 

The  (B+b)'  s  in  (10)  and  (19)  arc  in  charac¬ 
ters.  The  original  measured  results  are  in 
bits.  Here,  for  consistency,  we  represented 
tiiem  in  characters  with  a  scale  of  one  charac¬ 
ter  equaling  ten  bite. 

From  (10),  we  know  that  Ej(B+b)  >  0,  for 
all  11+b.  Thus,  the  W(B)  of  EjlB+b)  is  a  convex 
function  of  B.  From  (19),  E2(B+b)  <  0  for  some 
B.  Bonce,  we  need  to  compute  and  compare  Its 

and  •  We  find  that  for  the  range  of 

cili  ciLi  tj 

B  of  interest,  i.  o, ,  0  S  B  s  10  characters, 

2  •  Tliorcforc.  W(B)  for  E„(B+b) 

is  also  a  convex  function  of  B. 

In  the  same  manner  as  in  the  random  error 
ease,  we  substitute  (10)  into  (17)  and  then  (19) 
into  (17)  and  use  numerical  techniques  to  solve 
for  B°  for  each  of  the  transmission  rales.  The 
relationship  among  the  optimal  block  size,  * 
average  message  length,  b,  and  AH's  for 
Ejili+b)  and  122(B+b)  arc  portrayed  in  Fig.  5. 

When  — 5  arc  not  satisfied  for 

dB  dU 

some  regions  of  B,  numerical  results  might 
lead  to  a  local  optimal.  In  this  ease,  numer¬ 
ical  searcli  should  be  performed  in  these  re-  . 
gions  to  locate  the  local  optimals  of  each 
region.  The  global  optimal  block  size  B°  can 
then  be  selected  from  these  local  optimals. 


•Note  K^U+b)  and  E2(lHb)  are  represented  in 
characters  with  a  scale  of  one  character  equals 
ten  bits.  Should  the  message  not  agree  with  the 
scale  as  that  of  Ej(IJ+b)  and  E2(B-<b),  the  mes¬ 
sage  must  be  converted  into  the  same  scale 
(one  character  equals  ten  bits)  before  using 
*•  -g.  5  to  determine  the  optimal  block  size. 


ACKNOWLEDGMENT  DC  LAY  AND  TRANSMISSION  NATS 
FnOOUCT,  AN 


(5a)  E(B+b)*E j(B4b),  R*G00  bits/scc,  b«0 


RATE  FNODUCT,  AN  (CHAW 

« 


(5b)  E(B+b)«E2(B+b),  R-1200  bits/scc,  b-0'! 


NATE  FNODUCT,  AN  ICHAH) 

(5c)  E(B+b)»E2(B+b),  R-l,  200  bits/sec/  b*0 

Fig.  5,  Optimal  Fixed  Message  Block  Size  for 
Burst  Error  Channels  when  Message 
Length  is  Geometrically  Distributed. 


3.  DISCUSSION  OK  RESULTS 


The  optimal  block  size  D°  for  a  random  . 
error  channel  with  selected  average  message 
length,  block  overhead,  and  AH'  s  is  presented 
In  I>lg.  3.  Fora  specific  random  error  chan¬ 
nel,  the  optimal  block  size  Increases  with  the 
AH.  This  agrees  with  our  intuition,  i.  e. ,  if  A 
and/or  H  is  increasing,  (note,  A  is  independent 
of  li),  then  we  should  partition  the  message 
into  larger  blocks  so  as  to  reduce  the  number 
of  blocks  per  message.  - 

For  a  given  AH  and  average  message 
length,  the  optimal  block  size  is  larger  for  a 
smaller  error  rate  channel  than  for  a  larger 
error  rate  channel. 

For  a  burst  error  channel,  we  approxi¬ 
mated  Its  error  characteristics  by  a  poly¬ 
nomial.  In  the  same  manner  as  In  the  random 
error  channel  case,  we  substituted  E^B+b)  and 
E2(B+b),  respectively.  Into  (17)  and  numerically 
solved  for  the  optimal  message  block  size.  The 
B0'  a  for  selected  average  message  lengths  and-- 
selected  AH1  s  of  Ej(lHb)  and  E2(B+b)  are  pre¬ 
sented  in  Fig.  5.  For  small  AH  values,  the 
behavior  of  11°  is  similar  to  that  of  a  low  error 
rate  random  error  channel.  For  large  AR 
values,  the  behavior  of  13°  is  similar  to  that  of 
a  high  error  rate  random  error  channel.  This 
phenomena  is  due  mainly  to  the  fact  that  the 
burst  error  characteristics  havo  a  nonlinear 
effect  with  block  size. 

The  W(B)  for  a  random  cl*ror  channel  is 
always  a  convex  function.  The  W(B)'  s  for  a 
burst  error  channel  with  error  characteristics 
Ejdl+b)  and  E2(B+b)  are  also  convex  functions. 

In  general,  however,  the  W(B)  for  a  burst  error 
channel  is  not  necessarily  a  convex  function.  A 
few  typical  W(B)rs  are  shown  in  Fig.  6. 


Fig.  6.  Several  Topical  W(B)  vs  B 


We  noted  that  the  \V(B)'sj»rc  rather  Insensitive 
around  B°.  For  a  given  7.  AH.  and  channel 
error  characteristic,  the  W(B)  increases  as  the 
block  overhead  b  increases  as  shown  in  Fig.  7. 


-  ,?ig.  7-  Effect  of  W(B)  with  Bldfck  Overhead,  b. 

From  Figures  3c  and  5c,  wo  noted  that 
B°(b*0)  increases  as  b  increases,  and  tho 
amount  of  difference  between  B°(b=0)  and 
B°(b*0)  decreases  ns  AR  increases.  From 
Fig.  6,  we  noted  that  the  W(B)'  s  are  rather  _ 
insensitive  around  B°.  Therefore;  fof  a'systcm 
that  has  a  small  block  ovcrKcad  b,  and  a  large 
AR,  then  the  optimal  bldck  size  with  block  over- 
•hcad  is  approximately  equal  to  that  without  block 
overhead,  that  is,  B6(b*0)  ■  B°(b  ■  0). 

4.  EXAMPLE 

Consider  the  planning  of  a  computer  network 
that  consists  of  many  computers  and/or 

terminals.  These  computers  arc  remotely 
located  and  communicate  from  each  other  via 
communication  channels.  A  wideband  channel 
that  can  transmit  5,  000  characters  per  second 
is  used  between  each  pair  of  computers.  The 
channel  has  burst  error  channel  characteristics 
similar  to  E2<B+b)  as  shown  in  Fig.  4.  For  re¬ 
liability  in  information  transfer,  error  detection 
and  retransmission  arc  employed  in  the  system. 
Further,  the  message  is  partitioned  into  fixed 
size  blocks  for  ease  in  data  itandling  and  memory 
management.  The  block  overhead,  b,  is  approx¬ 
imately  equal  to  ten  characters.  The  acknow¬ 
ledgment  time  for  each  block  is  about  40  milli¬ 
seconds.  The  message  length  of  the  computer 
output  can  be  approximated  as  geometrically 
distributed.  We  would  like  to  consider  the 
optimal  block  sizes  for:  1)  average  message 
length  equal  to  500  characters,  and  2)  average 
message  length  equal  to  1,  000  characters. 


ft1'- 


The  acknowledgment -transmission  rate 
product  lb  equal  to  An-*10xl0"3x5xl0**>:2G0 
characters.  Solving  Equation  ( 11)  numerically, 
we  find  the  optimal  block  size  for  2  «  500  char¬ 
acters  is  37fi  characters,  and  from  Fig.  5c, 
the  optimal  block  size  for  2  “  1,000  characters 
is  527  characters.  Comparing  with  their 
optimal  block  sizes  for  b  *  0,  the  differences 
between  the  D°(b^0)  ar.d  E°(b-0)  are  within  ten 
characters. 
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ON  NON-BLOCKING  SWITCHING  NETWORKS  * 

DAVID  G.  CANTOR 

Abstract 

A  switching  network  »jy  be  informally  described 
as  a  collection  of  single-pole,  single -throw  switches 
arranged  so  as  to  connect  a  set  of  terminals 
inputs . to  another  set  of  terminals  called 
It  is  non-blocking  if,  given  any  set  of  connect ioqs 
from  some  of  the  inputs  to  some  of  the  outputs, 
idle  input  terminal  x  and  idle  output  terminal  y, 
then  it  is  possible  to  connect  x  to  y  without 
disturbing  any  of  the  existing  connections.  Denote 
by  o(a,b)  the  minimal  number  of  switches  necessary 
to  connect  a  inputs  to  b  outputs  using  a  non- 
blocking  network.  We  are  interested  in  studying  the 
growth  of  c(a,a^  as  a  -*  °°.  Results  of  C.  Clos 
show  that  o(a,a)  <  C  ae2  ^log  a’log  2  .  We  show 
that  o(a,a)  <  8a(log2  a)2. 


•This  work  was  support  in  part  by  the  Advanced  Research  Projects  Agency, 
Department  of  Defense  Contract  DAHC-15-69-C-0285 .  The  author  would  also 
like  to  thank  the  National  Science  Foundation  GP  #13164  and  the  Sloan 
Foundation  for  support  while  writing  this  paper. 
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ON  NON -BLOCKING  SWITCHING  NETWORKS 
DAVID  G.  CANTOR* 

1.  Introduction. 

A  Network  N  consists  of  a  graph  G;  two  sets  of  vertices 
^  of  G,  denoted  A  and  B  and  called,  respectively,  the  ( sets  of) 
inputs  and  outputs ;  and  a  set  P  of  paths  of  G.  Each  path  in  P 
connects  am  input  to  an  output  and  meets  no  other  inputs  or  outputs. 

We  write  N  =  (G,A,B,P).  A  state  of  N  is  a  subset  S  of  P  such 
that  no  two  paths  in  G  have  a  common  vertex.  A  state  S  defines 
a  bi jection  fg  from  a  subset  of  A  to  a  subset  of  B  as  follows : 
Suppose  p  €  S  and  p  connects  x  e  A  to  y  e  B;  put  fg(x)  =  y, 
and  repeat  this  for  each  path  in  S.  We  shall  say  that  a  path  p 
of  G  is  admissible  if  p  e  P.  If  x  is  a  vertex  of  G  we  shall 
say  that  x  is  busy  (in  the  state  S)  if  x  lies  on  a  path  p  e  S; 
otherwise  we  shall  say  that  x  is  idle  (in  the  state  S).  If  x 
is  an  input  of  G  and  y  is  an  output  of  G,  we  shall  say  that  x 
has  access  to  y  (in  the  state  S)  if  there  exists  a  path  p  e  P 
connecting  x  to  y  and  such  that  S  U  (p)  is  a  state. 

A  network  N  =  (G,A,B,P)  may  be  interpreted  as  a  switching  de¬ 
vice;  under  this  interpretation,  the  elements  of  A  are  considered 
as  input  terminals,  the  elements  of  B  are  considered  as  output  termi¬ 
nals,  and  the  edges  of  G  are  considered  as  single-pole,  single-throw 
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switches  which  are  normally  open.  Then  a  path  p,  which  connects 
x  €  A  to  y  e  B  may  be  thought  of  as  a  sequence  of  switches  which, 
when  closed,  connect  x  to  y.  The  state  S  yields  a  collection  of 
switches  (all  edges  on  any  path  in  S)  which,  when  closed,  connect 
inputs  to  outputs  as  described  by  the  function  fg. 

The  network  N  =  (G,A,B,P)  is  said  to  be  non-blocking  if 
given  any  state  S  of  N  and  idle  vertices  x  e  A,  y  e  B,  then 
x  has  access  to  y  in  the  state  S.  In  terms  of  the  switching 
network  interpretation  mentioned  above,  this  means  that  if  x  and 
y  are  idle  input  and  output  terminals,  respectively,  then  it  is 
possible  to  establish  a  connection  between  them  without  disturbing 
the  existing  connections . 


From  now  on,  all  the  networks  we  study  will  have  disjoint  in¬ 
puts  and  outputs  (i.e.  A  0  B  =  0  ). 

Given  positive  integers  a  and  b  we  are  interested  in  find¬ 
ing  those  non-blocking  networks  N  =  (G,A,B, P)  with  | A |  =  a,  |b|  =  b 
for  which  the  number  of  edges  of  G  is  minimal.  We  shall  denote  this 
number  by  o(a,b).  In  terms  of  switching  networks,  this  amount  to 
finding  non-blocking  networks  using  a  minimal  number  of  switches.  An 
obvious  non-blocking  network  with  a  inputs  and  b  outputs  is  the 
network  whose  graph  is  the  complete  bipartite  graph  on  vertex  sets  A 
and  B  with  | A |  =  a  and  |b|  =  b.  In  this  graph  the  set  of  vertices 
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is  A  U  B  and  there  is  an  edge  connecting  each  vertex  in  A  to  each 
vertex  in  B.  The  set  P  consists  of  all  paths  consisting  of  ex¬ 
actly  one  edge.  Thus  P  has  ab  elements.  In  the  switching  net¬ 
work  interpretation,  this  amounts  to  an  a  by  b  crossbar  switch. 

When  the  names  of  the  sets  A  and  B  are  unimportant,  we  shall  de¬ 
note  this  network  by  C  ,  .  The  network  C  ,  shows  that  o(a,b)  < 

ab  ab  — 

ab. 

It  was  Clos  [2]  who  showed  that  o(N,  N)  <  for  all  large  N. 

His  methods,  which  will  be  described  later,  show  that  cr(N,N)  < 
c  jfc2  ''dog  N)  •  ( log  2)^  We  win  ghQW  that  CT(NiN)  <  8N(log2  N)2. 

We  do  not  attempt  to  obtain  the  smallest  possible  constant  multi¬ 
plier,  for  it  is  not  clear  that  the  exponent  2  can  not  be  reduced. 

In  the  opposite  direction,  an  elementary  argument  shows  that 
cj(N,N)  >  C  N  log2  N,  and  nothing  stronger  is  known. 

The  author  would  like  to  acknowledge  many  stimulating  dis¬ 
cussions  with  Professors  B.  Gordon  and  C.  B.  Tompkins . 

2.  Constructions. 

We  shall  say  that  networks  N  =  (G,A,B,P)  and  N'  =  (G'jASBSP' ) 
are  isomorphic  (or  equivalent)  if  there  exists  a  graph  isomorphism 
.11  of  G  onto  G'  such  that  ^A)  =  A',  n(B)  =  B',  and  n(P)  =  P' . 

It  is  clear  that  the  property  of  being  non-blocking  is  preserved  under 
isomorphism. 
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If  N  =  (G,A,B,  P)  is  a  network,  we  define  its  transpose  Nf  to 
be  the  network  N*  =  (G,B,A,P);  clearly  N”  =  N. 

If  G  is  a  graph  and  C  is  a  set,  we  define  the  graph  G  x  C 
to  be  the  graph  whose  vertices  are  the  ordered  pairs  (x,c)  with  x 
a  vertex  of  G  and  c  e  C;  (( x^,c1),( Xg,c2) )  is  an  edge  of  G  x  C  if 

C1  =  C2  8111(1  ( xi*  X2^  is  an  e<3ge  of  G*  P  ls  a  in  G  whose 
vertices,  in  order,  are  xQ,x^, ...,xn  then  by  p  x  c  we  mean  the  path  in 

G  X  C  whose  vertices  are  (Xq,c),(x^,c), . . .,(xn,c) .  The  product  C  x  G 

is  defined  similarly. 

Now  suppose  Lj.  =  (G^,A^,Bj,P  )  (i  =  1  or  2)  are  networks; 
we  are  going  to  define  the  network  product  x  L^.  We  shall  denote 
this  product  by  N  =  (H,C,D,Q).  Put  C  =  A^  X  A g  and  D  =  B^  X 
The  graph  H  is  obtained  from  the  two  graphs  G1  x  A?  and  B^  x  G^  by 
identifying  the  vertices  in  B1  X  A?,  which  appear  in  both  graphs.  All 
admissable  paths  q  e  Q  of  N  are  obtained  as  follows :  Let  pi  e  Pi 
be  an  admissible  path  connecting  Xj^  e  A.^  to  yi  e  B^  (i  =  1  or  2). 

Then  p^^  X  x?  ends  in  the  vertex  (y1,Xg)  which  is  the  first  vertex 
of  y^  xp2<  The  path  q  =  (p1,p2)  is  defined  to  be  the  path  obtained 
from  the  paths  p^  x  x^  and  y^  x  p2  by  concatenating  them  and 
identifying  the  common  vertex  (y^x,,).  Note  that  this  maps  P1  x  P? 
onto  Q. 

In  the  switching  network  interpretation  this  construction  amounts 
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to  taking  j |  copies  of  L  and  ]  B^j  copies  of  L 2,  and  con¬ 
necting  the  outputs  of  each  of  the  copies  of  to  the  inputs  of 

all  of  the  copies  of  V  (see  Diagram  l). 

Let  a^b^c.jd  denote,  respectively,  the  cardinalities  of 
Ai'Bi,C'D'  ^  let  denote,  respectively  the  number  of  edges 

of  and  H.  The  following  relationship  between  two  by  two  ma¬ 
trices  is  easily  verified 


If  is  isomorphic  to  and  L 2  is  isomorphic  to 

it  is  easy  to  verify  that  L1  x  is  isomorphic  to  x  M^. 
Furthermore  (i^  x  Lg) •  =  x  L|.  Finally,  we  have  associativity: 
(L1  x  L2)  x  Lj  =  L1  x  (L2  x  1^);  we  will  usually  write  simply 
x  Lg  x  Lj.  We  will  abbreviate  the  k-fold  product 
LXLX-XL  by  Lk. 

We  also  define  a  triple  product  of  the  three  networks 
Li  =  (i  -  1,2,5)  when  T  be  a 

bisection  from  Aj  onto  B^;  the  triple  product  of  L^,L2,L^  de¬ 
pends  upon  the  choice  of  T  and  will  be  denoted  by  [L^,L2,L^]t. 

(in  many  cases  will  be  and  in  such  cases  we  will  choose  T 
to  be  the  identity  map.  In  any  case  those  properties  of  the  triple 
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The  lines  do  not  represent  edges;  they  connect 
output  vertices  of  to  the  injait  vertices 
of  Lg  with  which  they  are  identified. 

Figure  1.  L1  x  Lg 
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product  which  we  will  use  will  be  independent  of  the  choice  of  T  and 
we  will  frequently  write  [L^L^L^]  instead  of  )  Sup¬ 

pose  then  that  N  =  (H,C,D,  Q)  is 

We  put  C  =  A^  X  and  D  =  X  B^;  H  is  defined  as  the 
graph  obtained  from  the  three  graphs  G^  x  k^,  B.^  x  G^,  and 

G^  X  B^,  by  identifying  B1  x  Ag  in  G.^  X  Ap  with  B1  X  A 2  in 

B1  x  Gp,  and  by  identifying  A^  x  Bp  in  G^  x  Bp  with  t(A^)  x  Bp  - 

B^^  x  Bp  in  B1  x  Gp.  The  admissible  paths  q  e  Q  are  obtained  in 
the  following  way:  Let  p.^  be  an  admissible  path  of  connecting 
e  A ^  to  y1  e  B.^  (i  =  1,2,3)  and  suppose  t(x^)  =  y^.  Then 
V1  x  x2  ends  at  (y^Xg);  y±  X  begins  at  (y1,x2)  and  ends  at 
(y-^Yg);  and  p^  X  y2  begins  at  (x_,y2)  =  (y^g)-  The  path  q 

is  obtained  by  concatenating  p1  X  x2,  y1  X  p2,  p^  x  yg  and  identi¬ 

fying  the  vertices  common  to  two  segments  of  q. 

Note  that  [L^,L2,L^]  is,  in  general,  different  from 
L1  X  L2  x  (see  Diagram  2) . 

The  following  is  easily  verified;  we  omit  the  proof. 

THEOREM  2.1.  Suppose  L ^  =  (Gi,Ai,Bi,Pi)  (i  =  1,2,3, 4,5)  are 
networks .  Suppose  is  a  bisection  of  A^  onto  B^.  Then 

-« 

[l1,[l2,l5,l43t  ,l51t  =  [l^  X  L2,iyLu  X  l5]t  , 
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The  lines  do  not  represent  edges;  instead  they 
connect  vertices  which  are  to  be  identified. 


Figure  2.  [Lj,  LQt  Lj] 
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where  T,  is  the  bisection  from  A,  X  A,.  to  B,  X  B_  given  by 
j  4  5  J .  d 

=  ( T^( >  T^( a^) ) . 


3. 


The  Clog,  method,  and  some,  variations . 


The  basic  method,  due  to  Clos  [2]  and  quoted  by  Benes  [l]  may  be 
stated  as  the 


THEOREM  (Clos).  Suppose  L  =  (G,A,B,P)  is  non-blocking  and 

s  >  2r  -  1.  Then  N  =  [C  ,L,C  ]  is  non-blocking. 

—  -  rs  sr  — . .  . 

This  is  a  special  case  of  the  following  more  general 

THEOREM  3.1.  Suppose  L±  =  (G  ,A  jB^P^  (i  -  1,2,3)  are 
non-blocking,  that  |B1I  >  | A^ |  +  |b^|  -  1,  and  that  IB.J  =  |A^J. 
Then  =  [L^,L2,L]t  is  non-blocking  for  any  bisection  T  of 
onto  B1« 

Proof.  Suppose  =  (H,C,D,Q)  is  in  state  S,  and  that 
x  c  C,y  e  D  are  idle.  We  must  show  there  exists  a  path  q  e  Q  con¬ 
necting  x  to  y  and  having  no  common  vertices  with  any  path  in  S. 
Suppose  x  =  (ux,u2)  e  Ax  x  A2  and  y  =  (vj,v2)  e  B^  x  B2>  There 
are  |A^|  vertices  of  the  form  (u,u2)  e  A^  x  A2  and  at  most 
|A1|  -  1  of  them  are  busy.  Hence  at  most  |A1|  -  1  of  the  |B1| 
vertices  of  the  form  (y,u2)  e  B^  x  are  busy  and  hence  at  least 
|Bi J  -  |aJ  +  1  of  them  are  idle.  Denote  these  vertices  by 


119 


10 


(y^  j'XgJjCy^  *  ^2^  *  *  *  *  *  ,U2^*  ®o  that  r  >  |b..|  ”  l^i  I  ^  1»  Simi 

A1  J'2  r 

larly,  there  are  vertices  (zt  ,v2),...,(zi  ,\q)  in 

12  s 

AjXB2  which  are  idle,  and  s  >  |Aj|  -  |b^|  +  1.  The  r  +  s  ver¬ 
tices  y  ,y  ,...,y  ,  t(z  ),t(z  ),...,t(z  )  all  lie  in  R. 

1  2  xr  X1  x2  "s  1 

and 


r  +  s  >  |b1|  -  1^1  +  1  +  IbJ  -  |b5I  +  1 

>  |B1|  +  i  +  (IbJ  -  |aJ  -  |b5I  +  1) 

>  |b1I  +  1  . 


So  two  of  them  must  be  the  same.  Now  the  y.  are  all  distinct  and 

1J 

so  are  the  t(z  ).  Thus  there  must  be  a  y  equal  to  a  t(z  ), 

d  d  k 

say  yi  =  t(  z^  ) .  Since  1^  is  non-blocking  there  is  a  path  p1 
connecting  ^  to  yi  and  such  that  p1  x  Ug  has  no  common  ver¬ 
tices  with  any  vertex  in  S.  Similarly  there  is  a  path  p2  from 
u2  to  v2  in  Pg  such  that  y^  x  p2  has  no  common  vertices  with 
any  path  in  S,  and  there  is  a  path  p^  from  z^  to  v ^  in  P^ 

such  that  Pj  x  Vg  has  no  vertex  in  common  with  any  path  in  S. 

Let  q  be  the  concatenation  of  p1  x  Ug,  y^  x  p2,  and  p^  x  Vg 
with  the  appropriate  vertices  identified.  Then  q  connects  x  to 
y  and  S  U  {q}  is  a  state  of  N^. 
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REMARK  3.2.  Suppose  aA  =  |Ai|i  bi  =  |Bi|  and  is  the 
nuniber  of  edges  of  (i  =  1,2,3).  It  is  easy  to  verify  using  (l) 
that  N  =  [L^L^L^]  has  a^a2  inputs,  bgb^  outputs  and  that  its 
graph  has  agg1  +  b^  +  bgg^  edges. 

Clos  [2]  suggests  using  metworks  which  nay  be  described  as 

[L,  [L,  [L, . . .,  [L,M,L‘  ],L'  ],L*  ],...,!•] 

where  L  =  C  _  ,  and  M  =  C  .  By  Theorem  2.1,  this  is  the  same 
as  [L*,M,(L' )*],  where  l'1  =  LX  1XLX  •••  X  L  (t  times).  He 
shows  that  this  non-blocking  network,  which  has  nt+^  inputs  and 
outputs,  has 


5  t(5n  -  3)(2n  -  l)t_1  -  2nt] 

n  -  l 

edges.  This  follows  immediately  from  the  above  remark.  It  is  easy 

I  % 

to  verify  that  a  non-blocking  network  with  N  inputs  and  outputs, 

2  ‘J  log  N *  log  2 

constructed  by  this  method,  will  require  at  least  CQ  e 
edges,  where  CQ  >  0  is  a  constant. 
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Suppose  that  denotes  a  network  with  a  inputs,  b  out¬ 

puts,  and  whose  graph  contains  a  minimal  number  of  edges,  namely 
o(a,b).  Using  two  copies  of  L  shows  that  a{ a,  2a)  <  2<r(a,a).  Hy 
Theorem  3.1,  [L  ,L  ,L  ]  is  non-blocking  and  by  Remark  3.2, 

C&  ft^  &  C.Gij  & 

it  has  <  ao(a,2a)  +  2ao(a,a)  +  aa( 2a, a)  <  6ao(a,a)  edges.  Thus 
(2)  o( a2,a2)  <  6ao(a,a)  . 

logp  6 

Iteration  of  (2)  shows  that  o{N,N)  <  C  N(log  N)  .  This  result  can 

be  improved  by  considering  [L  0  ,L  _  ,L.  ] ;  this  network  has 

ab  inputs,  2ab  outputs  and  its  graph  has  3bo(a,2a)  +  2ao(b,2b) 
edges.  This  shows  that 


(3)  o(ab,2ab)  <  3ho(k>2a)  +  2ao(b,2b) 


log2  5 

Putting  a  =  b  and  iterating  (3)  shows  that  o( a, 2a)  <  C  a(logg  a) 
and  since  cr(a,a)  <  o(a,2a)  we  find  that 


(4) 


log  5 

o(N,  N)  <  C  N  (logg  N ) 


The  exponent  logp  5  can  be  decreased  by  choosing  a  and  b  differently, 
let  a  >  1  and  0  >  2  be  the  real  solutions  of  the  simultaneous  equa¬ 
tions 
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(4) 


{ 


cP'1 

(a  -  i)P-1 


5 

3/2 


} 


Numerical  computation  shows  that  a  —  2.37638  and  0  04  2.26922. 

Multiplying  the  second  equation  of  (4)  by  a  -  1  and  substituting 
from  the  first  yields  2(a  -  l)^  =  oP  -  3  or  equivalently 

(5)  3(l/a)P  +  2(1  -  1/af  =  l  . 

a 

We  now  show  that  if  p(x)  =  (log  x)  ,  then  p(x)  satisfies  the 
functional  equation 

(6)  n(z)  =  3n(x)  +  2p(y) 

where  x  =  z^a  and  y  =  z/x.  Indeed, 

3n(x)  +  2p(y)  =  3( ( log  z)/af  +  2(1^'  zf  (l  -  l/af 

=  (log  z)P 

=  U(z)  i 

using  (5). 

Now  cr(xf2x)/x  satisfies  a  functional  inequality  similar  to 
(6)  where  x  and  y  must  be  integers.  It  follows  that  for  each 
e  >  0,  there  exists  C£  >  0  such  that 
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a{  N,  2N)  <  C£  tflog  N)P+£  . 

For  comparison,  log^  5  =  2.32193 • 

4.  The  exponent  is  <2. 

Suppose  L  =  (G,A,B, P)  is  a  network  (not  necessarily  non¬ 
blocking).  We  shall  say  that  L  is  ef  type  ^j[m,n)  if,  given  any 
state  S  of  L  and  m  idle  inputs  X..X-,  ...>x  of  L,  then 

L  e  ID 

each  has  access,  in  the  state  S,  to  at  least  n  outputs  of  L. 

LEMMA  4.1.  Suppose  L  *  (G,A,B,  P)  is  of  type  T(m,m  +  n  -  l) 
for  1  <  m  <  k,  that  M  is  a  non-blocking  network  with  c  inputs 
and  d  outputs,  and  that  nd  >  a(  c  -  l) .  Then  Lx  M  is  of  type 
T(m,m  +  n'  -  l)  for  1  <  m  <  k  where  n*  =  nd  -  a(c  -  l)  and  a 
is  the  number  of  Inputs  of  L. 

Proof.  Take  k<m  idle  inputs  z^,z2, . . . ,z^.  Suppose,  for 
example,  that  Z±>Z2>  •  •  •, zk,  are  of  the  form 

(x1^y1)^(x2^y1),*«*,(xk.,y1)  > 

and  ^^l*  Zk!+2'  '  *  *,zk  are  of  the  form  ^Vyi^  where  i^2>  here 
the  x.  are  inputs  of  L  and  the  y.  are  inputs  of  M.  By  hypothesis, 
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^Xl,yl^  1148  access  to  at  least  n  +  k‘  -  1  vertices  of  the  form 
( u j , y j )  where  the  are  outputs  of  L.  Since  M  is  non-blocking, 
these  have  access  to  all  idle  vertices  of  the  form  (uj,v^)  where 

v^  is  an  output  of  M.  There  are  (n  +  k'  -  l)d  such  vertices.  How¬ 
ever,  as  many  as  (c  -  l)a  -  (k  -  k1)  of  these  could  be  busy;  this 

would  be  the  case  if  all  inputs  of  the  form  (x^y^,  where  i  >  2, 

other  than  z^,^,  Zlc'+2,  *  *  *,Zn  were  l>UBy*  Thus  z 1  has  access  to 
at  least 


(n  +  k'  -  l)d  -  (c  -  l)a  +  (k  k')  >  nd  -  (c  -  l)a  +  k  -  1 


=  ns  +  k  -  1 


output  terminals  of  L  X  M. 

The  following  theorem  provides  the  motivation  for  defining 
the  notion  T(m,n). 

THEOREM  4.2.  Suppose  M  is  a  non-blocking  network  and  L  is 
a  network  with  a  Inputs,  b  outputs,  and  of  type  T(l,n).  If 
2n  >  b,  then  [L,M,L*  ]  is  non-blocking. 

The  proof  is  similar  to  that  of  Theorem  3.1  and  will  be 
omitted. 

Now  choose  an  integer  k  >  1  and  put  =  Cg  x  C2~2  5 
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if  j  <  k,  then  Lj  has  2"  Inputs,  k*2^  outputs,  and  inductively 
by  Leona  4.1,  L^  is  of  type  TU^^ak  -  j))  and 
T( 2, 2^-1( 2k  -  j)  +  1).  Thus  1^  is  of  type  T(2,k2*_1  +  l).  Let  ^ 
be  obtained  from  L^  by  omitting  one  input.  Then  has  2^-1 

inputs  k*2k  outputs,  is  of  type  T(l,k*2k-1  +  l),  and  its  graph 
has  no  more  edges  than  the  graph  of  L^.  The  associated  matrix  of 


2  0  W2  0  xk-1 

4k  2k  '  '  4  2  ' 


=  2 


1  0 
2k2  k 


)  ■ 


k  2 

Thus  has  2  *2k  edges  and  if  N  is  any  non-blocking  network, 
then  by  Theorem  4.2,  so  is  [M^,N,M£j.  Thus  putting,  for  example, 

N  “  C2  2’  we  obtaln  a  non-blocking  metwork  with  (2  -  2)  in- 

puts  and  outputs  whose  graph  has  <2  (4k  +  2k)  edges.  It  is  im¬ 

mediate  that  cr(N.,N)  <8N(logg  N)2  for  all  N  >  2.  It  is  not  hard 
to  see  that  the  constant  8  could  be  considerably  decreased,  but 
the  major  open  question  is  the  value  of  the  exponent. 
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DELAY  IN  OCMIJICAXIOJ  AND  OTPUIER  NFl'.JOKKS* 

Leonard  Klein  rode 
Ocrputer  Science  Department 

University  of  California,  Los  Angeles,  California  90324 


I.  r.TRODUCTICN 

Delay  in  oomrunicatien  and  computer  networks  has 
reocntly  tcocre  a  subject  of  considerable  interest. 

In  this  F-Per  we  address  ourselves  to  the  topics  of 
analysis  and  optimization  of  such  nets.  Ihose  we 
consider  are  of  the  store-and-fcr.:ard  type  more 
cozrcnly  known  as  message-switching  networks. 

The  problem  confronting  the  network  designer  is 
to  create  a  system  v.hich  provides  suitable  network 
performance  at  an  acceptable  system  cost.  Since,  in 
message-switched  networks  the  messages  experience 
queueing  delays  as  they  pass  from  node  to  node,  the 
performance  measure  is  usually  taken  to  be  the  speed 
at  which  res  sages  can  be  delivered.  The  optimiza¬ 
tion  problem  is  to  achieve  minimal  average  delay  at 
a  fixed  network  cost  by  appropriately  choosing  the 
network  topology,  the  channel  capacity’  assignment, 
and  the  message  routing  procedure.  The  purpose  of 
this  paper  is  to  review  sane  of  the  methods  for 
handling  various  aspects  of  this  problem. 

II.  ANALYTICAL  TOOLS 

Hie  appropriate  tools  are  those  which  have  devel¬ 
oped  from  queueing  theory. 

11.1.  Single  Server  Systems.  Mjch  of  queueing 
thpery  considers  systems  in  which  messages  (custo¬ 
mers) 'place  demands  for  transmission  upon  a  single 
cannunicaticn  channel  (the  single  server) .  When  the 
average  demand  for  service  is  less  than  the  capacity 
of  the  channel  to  handle  these  darands,  the  system 
is  said  to  be  stable.  The  literature  on  stable  sin- . 
gle  server  queueing  systems  is  fairly  voluminous  as 
for  example  exemplified  by  the  excellent  work  by 
Ochen  (1].  Single  server  systems  are  characterized 
by  A(t) ,  the  distribution  of  in  ter  arrival  times  and 
B(x),  the  distribution  of  service  tires.  In  tha 
case  when  A(t)  is  exponential  (i.e.  Poisscn  arriv¬ 
als)  ,  then  the  literature  contains  fairly  complete 
results.  However,  when  both  A(t)  and  B(x)  are  arbi¬ 
trary,  then  the  situation  becomes  much  more  complex 
and  only  weak  results  are  available. 

Recently  attention  has  been  directed  to  develop¬ 
ing  approximate  solution  methods.  These  methods  in¬ 
clude:  placing  bounds  on  the  behavior  of  the  sys¬ 
tem;  studying  the  system  behavior  under  light  and 
heavy  traffic  conditions;  and  by  forming  diffusion 
approximations  to  the  physical  queueing  systems. 

This  last  approach  appears  most  premising  and  in¬ 
volves  replacing  a  discrete  randan  process  with  a 
continuous  random  walk  typically  with  a  reflecting 
barrier  at  the  origin  to  prevent  queue  sizes  and 
waiting  times  from  going  negative  (see  Gaver  (2]) . 
Numerical  results  which  have  been  obtained'  using 
the  diffusion  approximation  have  been  startling  in 
terms  of  their  accuracy  when  oerpared  to  the  origi¬ 
nal  queueing  probloi. 

11. 2.  Multiple  Nodes  and  Networks.  The  case  of 
interest  to  uas  pupet  is  tot  of  multiple  nodes  in 
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a  network  environment.  The  queueing  problems  en¬ 
countered  in  networks  axe  far  more  difficult  than 
single  server  problems.  The  difficulty  arises  due 
to  new  phenomena  which  occur  in  networks,  the  mest 
important  of  which  is  that  traffic  entering  a  no:' a 
in  the  network  is  dependent  upon  traffic  elsewhere 
in  the  network  and  on  other  nodes  through  v.hich-  this 
traffic  has  passed.  This  difficulty  manifests  it¬ 
self  in  that  A(t)  for  a  network  node  is  no  longer 
exponential.  A  second  difficulty  is  the  phenomenon 
of  blocking  which  occurs  when  the  finite  storage  ca¬ 
pacity  of  a  node  becanes  filled  and  the  further  re¬ 
ception  of  messages  is  terrporaril'-  prohibited.  This 
then  places  a  burden  on  neighboring  nodes  and  they 
too  tend  to  get  blocked  causing  the  effect  to  propa¬ 
gate  in  the  network.  This  effect  is  probably  cr.e  of 
the  least  understood  queueing  effects  in  the  study 
of  nets  and  has  significant  impact  upon  perforrer.ee. 

The  problem  in  which  customers  are  permitted  to 
move  among  a  collection  of  queueing  stations  in  seme 
random  fashion  was  studied  by  Jackson  [3] .  His  ma¬ 
jor-result  was  to  show,  when  the  system  is  stable, 
that  each  node  in  the  system  could  indeed  be  a:.’- 
lyzed  as  a  single  queueing  facility  (under  Marks  ian 
assumptions) .  This  represents  perhaps  one  of  the 
first  successful  attempts  at  decc-rcsir.g  a  network 
problem  into  a  series  of  simpler  single  node  problems . 
Another  fundamental  result  which  permits  deoenposi- 
tion  of  cueueing  networks  is  due  to  Burke  (4] ;  he 
showed  that  if  A(t)  and  B(x)  are  cwynential,  then 
the  departure  times'  are  also  exponentially  distri¬ 
buted.  Thus,  we  preserve  the  Poisson  nature  of  the 
traffic  flow  between  network  nodes . 

The  results  referred  to  in  the  previous  paragraph 
do  not  carry  over  trivially  into  massage  oriented 
ccmmmication  or  ocrputer  nets  since  messages  main¬ 
tain  their  lengths  as  they  pass  through  the  net.  The 
first  comprehensive  treatment  of  carmunication  r.ats 
was  carried  out  by  Kleinrock  (5).  Fortunately,  it 
aould  be  shewn  for  a  wide  variety  of  cannunicaticn 
nets  that  it  was  possible  to  introduce  an  assump¬ 
tion  which  once  again  permitted  a  deexipositien  of 
the  network  into  a  collection  of  single  nedes. 


Using  the  simple  structure  of  the  linear  equa¬ 
tions  of  motion  governing  Markovian  queues,  Wallace 
[6]  has  developed  a  procedure  for  solving  the  system 
of  equations  numerically. 


III.  OPTLMIZATICN  TOOLS 

Perhaps  the  first  oomuhi  cations  network  optimiza¬ 
tion  problem  was  posed  and  solved  by  Kleinrock  (5] 
in  which  he  assumed  that  the  network  topology  au-d 
the  channel  traffic  were  known  quantities.  Also,  ho 
assumed  that  the  traffic  was  Markovian  (roissth  ar¬ 
rivals  and  exponential  message  lengths)  and  justi¬ 
fied  certain  decomposition  assertions.  •  For  each 
ChJTnwl  Kh-->  Br*  tivii  asSJjyss-en*  of  oanarilgf  Ci  was 
found  which  run  uni  zed  the  average  network  daisy  T  to 
messages,  at  a  fixed  total  system  cost  D.  We  define: 

as  the  average  queueing  plus  transmission  tiro  on 


the  i^1  channel;  as  the  average  message  traffic 
on  the  1th  channel;  y  as  the  average  network  traf¬ 
fic  throughput;  and  as  the  oost  factor  on  the  i**1 
channel.  Hus  problem  takes  the  following  form: 

Problem  A;  Choose  the  set  of  channel  capacities, 

q,  to  minimize  T  at  fixed  cost  D  where 

T  -  £  (VriTi  (1);  D  =  £  dfr  (2) 

The  solution  to  this  problem  assigns  a  capacity  to 
the  i"1  channel  in  an  amount  equal  to  the  average 
traffic  carried  plus  an  excess  capacity  proportional 
to  the  square  root  of  that  traffic.  It  may  be  ob¬ 
served  that  a  related  capacity  assignment  (namely, 
that  which  gives  capacity  directly  in  proportion 
to  traffic  carried)  provides  an  average  message  de¬ 
lay  not  significantly  worse  than  the  optimum. 

These,  and  other  related  results,  were  published 
as  Kleinrock's  Ph.D.  thesis  (MIT)  in  1962  (this  work 
later  appeared  as  IS] ) .  Little  was  published  in 
this  field  from  then  until  1969  [7] .  Whatever  the 
reason  for  this  inactivity,  it  is  clear  that  the  re¬ 
cent  interest  is  due  to  the  development  of  computer 
networks.  In  1967  Roberts  [8]  proposed  the  idea  of 
an  experimental  computer  network  which  later  devel¬ 
oped  into  the  Advanced  Research  Pro*  sets  Agency 
(AFPA)  oenputer  network  (recently  reported  upon  in 
the  1970  SJCC  Proceedings) . 

In  a  forthcoming  paper  by  Meistnr  et  al.  19) ,  it  . 
is  observed  that  in  minimi zirij  T  in  Prcblsn  A  above, 
certain  of  the  channels  produce  rather  large  and  un¬ 
desirable  message  delays  Tj_.  As  a  result,  'Bister 
et  al.  pose  the  fpllowing  problem,  whose  solution 
*  is  closely  related  to  that  of  Problem  A: 

Problem. B:  sane  as  PrvJblau  A  except  T  is  given  by 

T-  [I  (Ai/r)Tik]1/k  (3) 

By  raising  q  to  the  k**1  power  they  find  that  for 
k  >  1,  one  forces  a  reduction  in  the  variation 
among  the  T: .  For  k  ■*  •»  the  minimization  yields  a 
constant  value  for  q.  When  k  =  0,  the  assignment 
reduces  to  the  proportional  channel  capacity  assign- 
sent.  The  amazing  observation  is  that  T  increases 
very  slowly  as  k  grows  from  unity.  Moreover ,  they 
show  that  the  variance  of  message  delay  is  mini¬ 
mized  when  k  is  chosen  equal  to  2. 

In  Ref.  [7]  Kleinrock  introduced  sere  first  at¬ 
taints  at  modelling  computer  nets  and  was  able  to 
show  that  simple  models  were  extremely  useful  in 
predicting  the  behavior  of  the  message  delay  in  the 
AR?A  oenputer  net.  In  a  subsequent  paper  110]  he 
introduced  the  following  variation  to  Problem  A 
since  the  oost  function  as  given  in  Eq.  (2)  was 
found  not  to  represent  tariffs  for  high  speed  tele¬ 
phone  data  channels: 

Problem  C:  Same  as  Problem  A  exoept  D  is  given  by- 

D  -  E  diq0  (4) 

where  0  s  a  £  1.  The  solution  to  Problem  C  cannot 
Hi  given  in  c'ortJ  Poim.  in 

a  numerical  solution  of  this  problem  to  the  AFPA  not 
it  was  found  that  the  massage  delay  varied  insignifi¬ 
cantly  with  a  for  .3  i  a  &  1.'  This  indicates  that 
the  closed  form  solution  to  Problem  A  may  serve  as 


an  approximation  to  the  more  difficult  Problem  C. 

IV.  ADDITIONAL  gNSICTPATICKS 

Minimizing  cost  at  fixed  average  ires  sage  delay 
by  appropriately  choosing  channel  capacity  is  the 
dual  for  problems  A,  B,  and  C.  This  was  studied  in 
TTOT  and  considered  recently  by  '..r.itney  £111  -  Choioe 
of  ne*-work  tcpolocy  was  considered  in  Kleinrock's 
origiial  work.  Recently  Frank  et  al.  [12]  consid¬ 
ered  this  problem  for  the  AR?A  net  and  developed 
suboptimal  search  procedures.  They  also  addressed 
the  problem  of  chocsing  an  optimal  channel  assign¬ 
ment  when  capacities  must  be  chosen  from  a  finite 
set;  Whitney"  [11]  and  Doll  {13]  considered  this 
problem  for  a  fixed  tree  topology.  Frank  et  al.  (14] 
devised  an  cpti-al  procedure  for  selecting  discrete 
channel  capacities  for  centralized  computer  networks. 

Message  routing  procedures  must  also  be  consid¬ 
ered.  Of  all  those  so  far  discussed,  this  problem 
lends  itself  least  to  analysis.  Lastly,  we  note 
that  the  ultimata  standard  in  these  problems  is 
measurement  of  real  systems.  Ibis  is  receiving 
considerable  attention  in  the  AKPA  net.' 

v.  crrcLcsicN 

The  attempt  in  this  paper  has  been  to  describe 
and  to  evaluate  various  tools  for  studying  delay  in 
oermuni cation  and  computer  nets.  These  tools  must 
be  considerably  improved.  Nevertheless,  they  have, 
been  useful  in  network  studies.  Among  the  most  dif¬ 
ficult  remaining  problems  we  mention  the  blocking 
effect  due  to  finite  storage  capacity,  the  analyses 
of  i.uuhi.*)  ^rcnjedoits,  and  the  ILsigii  of  network 
topologies. 
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